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FOREWORD 


Dr. Hattwick’s methods of measurement appear reliable for 
either experimental or practical use by the first year of school. His 
findings present further evidence for the early appearance and 
slow growth of pitch discrimination ability in children. But in spite 
of increased true singing ability, children become noticeably diffi- 
dent or resistant by the fourth grade level. 

This work is in sequence with Professor Williams’ previous Sta- 
tion study entitled The Measurement of Musical Development. 


Сбкокве D. STODDARD 


Office of the Direetor 

Iowa Child Welfare Research Station 
University of Iowa 

January 9, 1935 


Bureau Едп!. Psy. Research 
DAVID НАНЕ TRAINING COLLEGE 


CONTENTS 


CHAPTER 


II. 


TII. 


ту. 


PART ONE. А GENETIC STUDY OF DIFFERENTIAL 
PITCH SENSITIVITY 
MELVIN S. HATTWICK 
General Orientation and Scope of the Investigation 
General Orientation 
Scope of the Study 
Experiments in Methodology: The , Binging “Technique 
Historical Summary 
Problems in the raaton of iis пен ве“ Technique 
The Musieal Laboratory 
Interval Singing Experiments 


Summary of Interval Singing, а as a Technique. for Measuring 
Pitch Discrimination 


Experiments in Methodology: The Verbal Concept Technique 
Subjeets and Method , . . . б . 7 
Group Testing 5 
Individual Tests 
Group and Individual Test Results 
Choice of Intervals 
Number of Trials : 
Experimental Evaluation of Test 
Objective Criteria of Test Comprehension 
Results 
Value of the Present Group Test for Children in Fifth Grade 
and Under . 5 
Tentative Norms 
Group Diserimination XU 
Summary . T d 


Genetie Growth of Differential Pitch Sensitivity 
Normative Summary of Differential Reactions to Small Piteh 


Differenees From Four to Ten Years of Age 
Implieations of Results 
References 


PART TWO. MANUAL OF INSTRUCTIONS AND INTER- 
PRETATIONS FOR A PITCH DISCRIMINATION TEST FOR 
YOUNG CHILDREN 
MELVIN S. HATTWICK 
Materials Needed 
Instruetions 
Computation and Dorsten of Results 
Individual Tests of Piteh Diserimination for Young Children 
Disadvantages of the Pitch Test 


PAGE 


71 
71 
72 
73 
74 


6 IOWA STUDIES IN CHILD WELFARE 


CHAPTER Pace 


PART THREE. A NOTE REGARDING THE PSYCHOPHYS- 
ICAL ANALYSIS OF PITCH DISCRIMINATION IN YOUNG 


CHILDREN 
Навого M. WILLIAMS and MELVIN S. Hartwick 
Determination of Group Thresholds . з . ‘ 77 
Fit of Group Data and Normal Probability Tategral a E E 79 
Distribution of Individual Thresholds . « = . * з 79 
Referenees E . 5 Е . Я я . . . . 84 


PART FOUR. IMMEDIATE AND DELAYED MEMORY OF 
PRESCHOOL CHILDREN FOR PITCH IN TONAL 


SEQUENCES 
Накогр M. WILLIAMS 

Experimental Conditions А s 3 x у Е ы E 87 
Method of Scoring 5 = 5 š 5 у 4 є г 88 
Learning Curves . . s . 2 89 
Correlations Between Immediafe dui Delayed Recall . Е . 92 
Conelusions . . 3 + $ 5 5 г А P я 93 
References s ы * è 5 s я я Р а T 94 

APPENDIX 
Exaggeration of Intervals s 5 97 
Proeedure and Instruetions for the Second Тиберий Singing’ Test 97 
Typical Procedure and Instructions used in Group Testing . * 98 


A Recorded Test of Piteh Diserimination for Young Children . 99 


РАКТ ОКЕ 
А GENETIC STUDY OF DIFFERENTIAL 
PITCH SENSITIVITY * 
BY: 


MzrviN S. HaTTWICK 


* This study was directed by Dr. Harold M. Williams. 


—X е — _-=—— 


—— ВАН 


CHAPTER I 


GENERAL ORIENTATION AND SCOPE OF THE 
INVESTIGATION 


GENERAL ORIENTATION 


The purpose of this investigation was to study certain aspects of 
piteh sensitivity in young children, partieularly the problem of 
differential responses to small intervals. 

At its lowest level piteh sensitivity has been shown to be ex- 
pressed by changes in a generalized type of body activity (20); at 
its highest level it involves an integration of sensory with cognitive 
processes, measurable by means of verbal responses, as in pitch dis- 
erimination. Between these two extremes development is taking 
place in the sensory and cognitive processes with evidence support- 
ing the view that the former is completed earlier than the latter. 
In a study of differential pitch sensitivity one must consider (1) 
the possibility of finding a useful method of measurement which 
does not necessitate the use of complex conceptual devices, and 
(2) the possibility of revising present methods using the verbal 
concept so that they may be simpler and easier for children to un- 
derstand. The emphasis in the present study is on the latter. 


SCOPE OF THE STUDY 


Possible Methods of Inwestigation 

Of the possible methods for measuring differential reactions to 
pitch, there are four which seem to be most promising. These might 
be termed the interval matching method, the conditioned response 
method, the singing method, and the verbal concept method. 

Interval Matching Method. — The interval matching method, sug- 
gested by Williams (23) on the basis of a teaching device of Mon- 
tessori (14), consists in providing the child with a set of bells sim- 
Паг to those described later in this study (р. 13). The child is in- 
structed to strike these in the same order the experimenter used, 
on the analogy of color matching. The natural interest of such a 
method recommends it for children, but certain important objec- 
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tions are met in the mechanies of the test. For example, unless the 
child strikes the first of the two tones correctly each time, he may 
become confused by hearing a tone he did not expect. The time 
required for the child to respond by striking the bells after the 
test presentation is another disadvantage. Still another is the in- 
ability of the child to strike the bells at the approximate intensities 
of the test interval, a fact whieh would undoubtedly tend to con- 
fuse him as the intervals became smaller. This method was rejected, 
therefore, for the present study. 

Conditioned Response Method. — The conditioned response 
method has one distinet advantage as a measure of pitch diserim- 
ination in that it might permit one to establish a threshold ас- 
eurately for very young children or even infants. Probably the 
greatest disadvantage lies in the prohibitive amount of time neces- 
sary. Furthermore, it would not yield a limen comparable with 
procedures already used with adults. Finally, it might lead to 
nervous indications when very small differences are used, such as 
Pavlov (15) found in working with dogs. This method was also 
rejected for use in the present study. 

Singing Method. — In the singing method the child listens to sim- 
ple intervals under the instruction to sing what he hears. The 
essential requirements of the method are that the child understand 
instruetions and have a certain degree of voco-motor eontrol. Two 
criteria are possible: singing the intervals accurately and singing 
them directionally (that is, following the pitch change in direction 
if not in absolute amount). Since the instructions are simple, no 
serious difficulty may be expected from this source. The second 
requirement, however, presents a more serious problem since a 
lack of voco-motor control necessarily results in failure of the 
method. To the extent that children tend to imitate exactly the 
pitch of the tones a demand is made on the discriminative ability 
of the experimenter. The advantages of the method may be listed 
as (1) simplicity of the instructions, (2) rapidity of the test, and 
(3) the apparent early spontaneous appearance of the ability to 
imitate pitch. 

Verbal Concept Method. — Тп this method two tones are pre- 
sented to the subject, who is required to respond verbally with 
some sign such as ‘‘The last is higher.” This method imposes cer- 
tain requirements on the subject, the most important of which is 
that he understand the concept employed. The ‘‘same-different?’ 
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terminology used by Gilbert (5) and Meissner (12) іп ехрегі- 
ments with children must be considered unsatisfactory because of 
the possibility of judging the two tones on the basis of intensity * 
rather than pitch. The terms ‘‘higher-lower’’ of the Seashore test, . 
if properly comprehended, are clear. А point which may be raised 
against this terminology, however, is the possibility of confusion as 
to which of the two tones is to be judged in terms of the other. In 
addition, one must make certain that the terms ‘‘high’’ and "ом"? 
have been properly associated in the pitch continuum. 

Williams (23) has suggested a terminology of ‘‘going upstairs, 
going downstairs’’ which, after preliminary investigation by the 
writer, has been shortened to ‘‘going up — going down.’’ From the 
standpoint of logie and simplicity in comprehension this terminol- 
ogy has certain advantages for use with children. The phrases 
«своте up — going down”? are simple enough, at least in the visual 
field, to be understood by praetieally every normal child from 
five years оп." The word ‘‘going’’ suggests a directional idea each 
time and helps the child realize that he should judge the last tone 
in relation to the first without the necessity of repeating this dis- 
tinetion. 

On a logical basis, the greatest difficulties of the concept method 
have to do with the comprehension of terms. These difficulties are 
counterbalanced to a large extent by such advantages as (1) quick- 
ness of test procedure; (2) the possibility of establishing a clear 
criterion for judging; (3) the fact that this method, if perfected, 
gives the commonest conception of a limen in this field; and (4) the 
general musical validity of such a technique. 

Summary. — From the survey of possible methods, those of sing- 
ing and verbal concept seem to be the least objectionable and to 
show the most possibilities. These two, therefore, have been chosen 
particularly for study. The experimental data to be presented in 
this research may be listed as follows: 

With regard to the singing procedure 

1. Degree of resistance to singing at various age levels under different 
conditions of motivation 
. Percentage of children at various age levels able to reproduce inter- 
vals directionally 


3. Effect of practice оп the responses 
4. Comparison of results from large and small intervals 


to 


116 is practically impossible to insure equal intensity for every tone of every pair, 
although practice may minimize the difficulty. In the Seashore record the intensities were 


intentionally varied in в random way. 
2 This statement will be shown to be true under the section on page 39 in Chapter IV. 
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With regard to the concept procedure 
1. Effect of factors such as concept terminology, pace of test, type of 
recording on group and individual test scores 
2. Effect of practice on the responses 
3. Percentage of children testable at various age levels by group and 
individual methods 
In addition, a standard procedure for group testing has been 
evaluated and tentative norms derived. Finally, а descriptive nor- 
mative summary is given based on all the results of experiments to 


date. 


Sources of Data 


The children serving as subjects in this study were enrolled in 
the following schools: (1) the four- and five-year-old groups of the 
preschool laboratories at the Iowa Child Welfare Research Station 
at Iowa City, Iowa; (2) the first and second grades of the Univer- 
sity Elementary School at Iowa City; (3) the seeond to fifth grades 
inclusive of St. Patrick's Parochial Sehool at Iowa City; and (4) 
the first to fifth grades inclusive from twenty-six elementary 
schools in the Des Moines Public School System at Des Moines, 
Towa. 

The total number of children on whom experimental data are 
presented in the following sections is 3,902. Тһе age range of these 
children was from three years, six months to twelve years, four 
months, 


Apparatus 


One of the major difficulties in testing the pitch discrimination of 
children lies in the fact that a simple, reliable, and interesting 
(from the child’s standpoint) means of producing tones is difficult 
to find. Up to the time of this investigation the sound sources for 
pitch tests have been tuning forks, variable pitch pipes, and Vic- 
trola records such as the test of pitch discrimination in the Sea- 
shore series (recorded by the Columbia phonograph company). 
McGinnis (11) has reported unfavorably on the use of the Sea- 
shore records, except with modifications, for testing pitch dis- 
crimination of children. Seashore (18), Allen (1), and Halverson 
(reported by Town (21)) have used the tuning fork technique 
with children, the first two investigators having had the children 
report by concept and the latter by singing. All three attribute 
their difficulties more to the method of testing than to any inade- 
quacies of sound production. Gilbert (5) and Meissner (12) used a 


em 
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variable piteh pipe as a source of sound. In a recent study Hissem 
(8) reports having used scientifically tuned chimes or gongs in 
testing children's ability to diseriminate simple tones or rhythms. 

The sound-producing apparatus for the present investigation 
consisted of a vibrating bar hung over a resonator, which was 
tuned to the bar. By striking the bar with a small rubber hammer 
a pure tone is produced. The bars were made by the Deagan com- 
pany and are similar to those found on dinner gongs. They were 
tuned to the appropriate pitches * by the investigator for the test- 
ing series. 

The resonators were made from ordinary one and one-half inch 
brass tubing. One end was sealed by a brass cap and a three-fourths 
inch hole was cut from the side of the tube halfway from the ends. 
The other end was closed by means of a cork with a smooth inner 
surface of cardboard which would reflect well. By means of the 
adjustable cork the length of each resonator could be adjusted so 
that each would “вреак”” at a maximum when the bar was struck. 
The general construction was identical with the usual dinner gong. 
The bells could be easily handled by the experimenter at the same 
time that responses from the subject were being recorded. Exten- 
sive practice in striking and damping the tones reduced to a min- 
imum the possibilities of any large differences in the length or 
intensity of the sounds. 

И method was used in tuning the bars. Reliability between three observers in 
counting beats varied from .961 = .002 to .928 + .006 for various rates of beating tones, 
Oscillographic records were taken of the tone produced when the 442 4. у. bar was struck 
medium loud (as used in testing) and very loud. Harmonic analysis (Henrici analyzer) 
showed that in the medium loud tone 98.6 per cent of the total energy was contained in 
the fundamental component, while in the very loud tone 98.4 per cent of the total energy 
was found in the fundamental. The remaining energy, 1.4 per cent and 1.6 per cent re- 


spectively, Was distributed over the remaining nine partials in very small proportions. 
(Each wave was analyzed through the first ten partials.) 


CHAPTER II 


EXPERIMENTS IN METHODOLOGY: THE 
SINGING TECHNIQUE 


HISTORICAL SUMMARY 


Seashore (19) has suggested that piteh differences are exaggerat- 
ed when reproduced in singing. Miles (13), and later Williams 
(22), confirmed this statement with respect to adults. Halverson 
(reported by Town (21)), however, was the first and, until the 
present investigation, the only one to utilize the suggestion in a 
test with children, 

Halverson asked fifty-three children, ages five to seven inclusive, 
to sing the sounds they heard (tuning forks placed before resona- 
tors being used as sound sources). “Тһе frequently occurring ex- 
aggerations of interval by the children, if in the right direetion, 
were taken to mean that the children really appreciated the dif- 
ference in pitch. The least difference in pitch that the child dis- 
tinguished with certainty was recorded as the зеоге.”? ((21), p. 18) 
He concluded that **the variations in the findings are so great that 
the group norm is useless, The seores vary from 2 to 17 double 
vibration differences, with 13 total failures. ”? ( (21), р. 18-19) 

The number of failures (25 per eent) and the variability in the 
Seores, however, may have been due to a variety of faetors. These 
might have included failure to exaggerate the differences enough 
(which in very small interval differences could easily be confused 
with failure), inattention, disinterest, or fatigue, since it cannot 
be determined from the published report how many trials were 
given each child. 

In general, errors due to subjective estimation of responses when 
using the singing technique almost always yield results that are 
unfair to the child. Nevertheless, since the instructions in the 
method are easily understood at young age levels and since re- 
Sponses are apparently readily given in most cases, the technique 
can hardly be dismissed on the adverse but inconclusive evidence 
presented by Halverson. 
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PROBLEMS IN THE EVALUATION OF THE INTERVAL 
SINGING TECHNIQUE 


It has seemed worth while to investigate the singing method more 
thoroughly. The following questions have accordingly been given 
experimental consideration in this section: (1) To what extent is 
there emotional resistance to singing? (2) What per cent of those 
responding satisfactorily сап reproduce the intervals in the correct 
direction? (3) Is exaggeration of intervals, if found, great enough 
to permit reliable subjective estimates of direction as such inter- 
vals grow smaller? (4) What are ің йесін of minimum practice on 
success ? ? 

Since the singing technique would seem to have its greatest use- 
fulness at the lower age levels, the presentation of data in this 
seetion follows the order of inereasing age. 


THE MUSICAL LABORATORY 


To provide adequate motivation in this experiment, the investi- 
gator arranged a musical laboratory for younger children in which 
he had a number of musical instruments including a xylophone, a 
number of organ pipes both wood and metal, Chinese gongs, 
whistles, bells, and so forth. 

The child was brought into this laboratory and allowed to play 
with the musical materials while the experimenter made sugges- 
tions and answered questions. At an appropriate time the sugges- 
tion was made that both the experimenter and the subject play a 
game with the bells which were sitting on a table. After a prelim- 
inary introduction to the test in this manner, the child was taken 
away with the promise of another game on the following day. The 
performance of the children on these preliminary trials was never 
included in the experimental data. 


INTERVAL SINGING EXPERIMENTS 


Each ehild was told to sit quietly, shut his eyes, and listen to the 
bells and as soon as the bells stopped to sing ''ding-dong"' just like 
the bells. The experimenter sang several trials for demonstration 
and then tested the child.* 

Eighteen four-year-old and twenty-four five-year-old children 
(ages four years, two months to five years, eleven months) from 

4 Recording was done by the experimenter on special blanks in all individual testing 


throughout the study. After some practice he became quite proficient in ringing the bells 
with the right hand and recording with the left. 
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the presehools of the Iowa Child Welfare Research Station served 
as subjects in this experiment, 
First Main Interval Singing Test 

In this test twenty-five trials were given. The intervals used (26, 
14, and 8 d. у.) were chosen merely for preliminary orientation 
and it was not expeeted that they would really approaeh threshold 
value. The direction of the intervals on each trial (whether they 
should be **going пр” or **going down") was determined original- 
ly by random choice and plaeed on a key. 

"The results of this test are found in the following tabulation: 

Age, Years 
Children 4 5 Total 


Num- Per Num- Per Num- Per 
ber Cent ber Cent ber Cent 


Tested 16 24 40 
Responded for twenty-five trials 12 75 17 71 29 73 
Singing direetion correetly in 90 per 

cent of trials 3 25 12 70 15 52 
Refusing to sing but accepting а 

verbal method of response 3 19 5 21 8 20 
Refusing to sing or respond verbally 1 6 2 8 3 8 


These results are very similar to the results found in a shorter 
test in whieh but a single large interval was used. The proportion 
of success in directional singing was small. It seems fairly conclu- 
Sive, therefore, that voco-motor control presents the fundamental 
diffieulty of this technique at those ages. 

Since one-fourth of the children refused to sing іп this experi- 
ment, it was impossible to determine what pereentage of the total 
group could sing. The following procedure was undertaken, there- 
fore, in an effort to secure responses from every child. 

Ediphone Test of Interval Singing 


This experiment was carried out (1) in an attempt to add more 
motivation, (2) to determine whether the children sang relatively 
accurately or merely directionally, and (3) to determine whether the 
intervals sung were exaggerated to a perceptible extent. 

The experimenter sang **ding-ding"' into the Ediphone horn and 
instrueted the child to sing just as the experimenter had. The child 
was promised that he would be allowed to hear later how elosely 
his voiee sounded like that of the experimenter. 

A flexible proeedure was used in this experiment, the basie prob- 
lem being to determine what proportion of the children could sing, 


5 The abbreviation d. v. refers to double vibrations throughout the study. 
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either on pitch or directionally, with this added motivation and a 
reasonably objective record of response. While the dictaphone is 
not sufficiently accurate to yield precise values by phonophotog- 
raphy, it is accurate enough to give information regarding tenden- 
cies of exaggeration. The accuracy of the experimenter’s ear has 
been previously reported (7). Root (17) has demonstrated the ac- 
curacy of the method of repeated hearings of a recorded stimulus. 

Тһе stimuli were vocal major thirds and seconds. Тһе exaet num- 
ber of trials per child was determined as follows: If a child sang 
at least nine out of the first ten trials in the same direction as the 
test trials sung by the experimenter, there seemed to be little doubt 
that he was testable. If he failed on more than one trial, the test 
was made at least twice as long (generally twenty-five trials) in or- 
der to determine a more accurate percentage of success. 

One hundred twenty-six children from the preschools of the 
Iowa Child Welfare Research Station and the University Elemen- 
tary School of the State University of Iowa were used as subjects. 
These children did not include any of the group tested in the prev- 
ious experiments. The age range was from three years, six months 
to eight years, eight months. 

The Ediphone proved an excellent motivating device; every child 
would sing because each was promised that he would be allowed 
later to hear if his voice sounded like the experimenter’s. Table 1- 
and Figures 1 and 2 show the results. There is a progressive in- 
crease in voeo-motor control with an increase in mental and ehron- 


TABLE 1 
"Relationship of Voeo-Motor Control to Chronologieal and Mental Ages 
Per Cent Per Cent 


Per Cent |Singing Ac-|Singing Only Per Cent 
Age, Chil- Who Would |eurately (90|Directionally| Exaggerat- 
Years dren Sing Per Cent |(90 Рег Cent} ing Inter- 
Success) Success) vals 
Chronological Age 
4 21 100 0 5 5 
5 26 100 13 23 19 
6 21 100 14 48 48 
7 35 100 38 47 47 
8 23 100 26 44 44 
Mental Age 
4 9 100 0 9 9 
5 10 100 0 10 10 
6 93 * 100 4 13 10 
Т Ag ? 100 5 36 36 
8 94 100 42 42 42 
9 27 100 37 45 45 
10 16 100 29 64 64 
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CHRONOLOGICAL AGE 


Figure 1. Relationship of Voeo-Motor Control to Chronological Age 
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MENTAL AGE 


Figure 2. Relationship of Voco-Motor Control to Mental Age 
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ological age. The correlations of mental age and chronological age 
with ability to reproduce simple intervals directionally are as #01- 
lows: 


Age r РЕ, Eta RE 
Chronological 505 .052 -612 043 
Mental 424 .056 541 048 


These correlations аге low but significant. They suggest that some 
combination of general maturity and possibly understanding of 
directions or coöperation is at work. 

From Table 1 it may be seen also that those who sang the inter- 
‘vals in the correct direction almost always exaggerated such inter- 
vals. This is in agreement with previous experimental data on 
adults. However, a considerable number sang accurately and hence 
showed no exaggeration. Sinee there were so few successes on either 
eriterion at these early ages, the data on this point are inconclu- 
sive. Failure to exaggerate in some cases may be due to inadequate 
voco-motor control at this level. At the later ages, such failure is 
due, іп part, to increased accuracy in singing intervals on pitch. 

Aetual piteh diserimination tests given to twenty-two children 
who eould and would sing what they heard give additional but still 
inadequate information on exaggeration of intervals. In these tests 
the majority of the children did not exaggerate the intervals of 
the test. At the lower intervals (3 and 2 d. v.) the test was stopped 
because the experimenter соша not rely on his subjeetive esti- 
mates. (See Appendix, p. 97) From these data it is evident that the 
singing method is limited in its use, especially in measuring dis- 
erimination of fine intervals. 

Second Main Interval Singing Test 


This test was an attempt to try out the singing method in a school 
whose pupils might more nearly represent the average school pop- 
ulation in mental ability and socio-economic status than did the 
subjeets previously used. The problem here, as in the previous 
interval singing test, was to determine what per cent of children 
in a typieal sehool responded at least direetionally to simple inter- 
vals. In addition, there was an attempt to determine whether any 
improvement would occur after a very short training period. 

In this second interval singing test eighty children from the sec- 
ond to the fifth grades inclusive served as subjects. The age range 

6 The children referred to here were enrolled in St. Patrick's parochial school at Iowa 


City, Iowa. The writer wishes to thank the authorities of the school for their coöperation 
in the testing program. 
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was from six years, eight months to twelve years, five months. The 
bells deseribed earlier were used as sound sources. Instructions and 
procedures were similar to those used in the first main interval sing- 
ing experiment. 

TABLE 2 


Per Cent of Suecesses in Singing a Simple Interval in Correct Direction 
Before and After Minimum Practice 


Per Gent | Per Cent | Per Cent. 
Per Cent | Succeeding | Succeeding | Improve- 


Age, Chil- Who Would |Directionally|Directionally| ment With 
Years dren Sing Before After Minimum 
Training * | Training * | Training 
7 8 100 63 63 

8 15 100 33 40 10 

9 18 100 44 56 20 

10 27 100 19 41 97 

11 12 100 17 17 0 


* 90 per cent success. 


Table 2 gives the results of the experiment. The percentages of 
success in directional singing are fairly comparable, for similar 
chronological ages, to those found in the previous sehool system. 
Тһе frequeney of собрегабіоп, however, fell off markedly at the 
older ages. There was obvious reticence shown by many of the 
children when asked to sing. 

The percentage of improvement with minimum training which 
is recorded here may have been due entirely to the breaking down 
of reticence in singing. Whatever the cause, it may be seen that a 
short training period does have some slight effeet, although the 
total per cent of children testable even after training is less than 
50 per cent at the fifth grade level. 


ТЕ is as important to recognize the limitations of the singing 
арргоаеһ for the higher levels as it is to recognize the limitations 
of the verbal concept approach for the lower levels. The present 
experiment, although very limited in seope, suggests the inadvisa- 
bility of using the singing technique above the first few school 
grades. 


SUMMARY OF INTERVAL SINGING AS A TECHNIQUE FOR 
MEASURING PITCH DISCRIMINATION 


The interval singing technique for testing pitch discrimination 
provides an interesting situation for the young child, uses instrue- 
tions which are easily understood, and requires no great effort in 
response (appearing almost reflexive when present). NES 

S.C.E R.T., West B +" “ел 
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Results from experiments on voco-motor eontrol, however, indi- 
eate three major weaknesses in the method which seriously hand- 
ieap its uses as a measuring instrument: (1) Not all children сап 
reproduce a stimulus directionally. (2) Many sing quite accu- 
rately and do not exaggerate intervals to an observable extent. 
(3) A considerable amount of emotional resistance is encountered. 
The method is valuable largely as a supplementary measure as, for 
example, in cases of early voco-motor control or of late cognitive 
development. 


CHAPTER ІП 


EXPERIMENTS IN METHODOLOGY: THE 
VERBAL CONCEPT TECHNIQUE 


The methods employed in testing discriminative ability of adults 
and older children have almost uniformly incorporated some type 
of verbal response. At the younger ages, however, where mental 
growth and experience are at less advanced stages, one might 
expect to encounter difficulties (3, 6, 16). Differences in cognitive 
discriminative ability may be due to any of the following reasons: 
(1) difficulty in the terminologies themselves, (2) differences in 
difficulty of the same terminology when used in different sense 
fields, and (3) complex instructional situations which are too in- 
volved for the child to grasp, despite the fact that he may under- 
stand the meaning of the terminology used. Factors other than 
difficulties in terminology — pace, motivation, methods of record- 
ing — may also influence the child’s behavior in a test situation 
in which the verbal concept method is used. All of these factors 
must be taken into consideration since the basic essential in a test 
of pitch discrimination is that the child make judgments on the 
basis of pitch differences alone. 

The factors chosen for study in group and individual situations 
in the present experiment, of course, do not exhaust the list of 
possible influences. Instead those of major importance were chosen 
for investigation. Tests were given to comparable groups begin- 
ning with the fifth grade and proceeding downward to a point 
where profitable returns from group tests ceased. To supplement 
the group tests, individual tests were introduced. The general ar- 
rangement of this chapter, therefore, is downward with respect to 


age. 
SUBJECTS AND METHOD 


Group tests were given to 2,957 children from twenty-three ele- 
mentary schools in the Des Moines publie sehool system. Individual 
tests were given to forty-eight children from the elementary schools 
of Des Moines, eighty ehildren from St. Patriek's sehool in Towa 
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City, and 126 children from the Iowa Child Welfare Research 
Station preschools and the University Elementary School. 

In all group tests fifty trials were given at one sitting. Each 
test consisted of introductory remarks and instructions with a 
Short praetiee period. A short pause was made and instruetions 
were repeated after each ten trials. Full typieal instruetions (for 
both the ‘‘going up — going down” and the ‘‘higher-lower’’ ter- 
minologies used in the following comparisons) are given in the 
Appendix (p. 98). 

In individual testing the situation was less formal, and the ex- 
perimenter added words of encouragement when they were appar- 
ently needed. The number of test trials per sitting varied with the 
individual, as few as ten being given in some cases and sixty in 
others. Those who responded to less than twenty-five trials at one 
sitting were brought back later for more testing. The game atmo- 
sphere was always present. If children asked questions about their 
performance, they were answered in an encouraging way; this sit- 
uation frequently occurred. 

By using short tests, breaking longer tests into short tests, and 
making the test a game, a consistent attempt was made to intro- 
duce motivation. In some cases there appeared to be overmotivation 
rather than undermotivation. 


GROUP TESTING 


Evaluation of Two Concept Terminologies 

It was not possible in the present experiment to test out every 
type of verbal concept terminology. Two, however, seemed superior 
to the others. These were the ‘‘higher-lower’’ terminology used in 
the Seashore pitch discrimination test and a ‘‘going up — going 
down’’ terminology. The latter terminology had proved itself 
useful and comparatively easy to understand in preliminary indi- 
vidual tests with younger children; the former had been used in 
group tests as low as the third grade (2). 

The relative effectiveness of each terminology was tested by the 
use of comparable groups. The method of securing and testing 
these groups was as follows: Group A consisted of those children 
seated at the odd numbered desks, while Group B consisted of those 
seated at the even numbered desks.’ Group В children were taken 
to a nearby classroom while the test with ‘‘going up — going 


7 This method of selecting comparable groups was used in all group testing. 
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down’? terminology was given to Group A. At the close of the first 
test the two groups exehanged places and the test was repeated 
but using the ‘‘higher-lower”’ terminology. 

The first four intervals on the Seashore test record of pitch dis- 
crimination were employed as stimuli in this comparison. Ten 
trials each were given on the following intervals in the order 
named: 30, 23, 17, 12, and 30 d. v. These intervals were the easiest 
in the test and were used because they offered the minimum of 
discrimination difficulty. All recording was done by using Form A 
described on page 32. 

А total of 1,202 children from the second, third, fourth, and fifth 
grades of eight elementary schools of the Des Moines publie sehool 
system was used in this experiment. 

Tables 3 and 4 give the results of the comparison. On the whole, 


TABLE 3 


Per Cent Meeting Criterion With Two Terminologies (Going Up — Going 
Down and Higher-Lower) With the Seashore Pitch Record 


Child Per Cent 90 Per Cent 
T todd Completing Successes on 
Cu Test Blank Two Easiest Intervals * 
Е Е Going Up— Higher- 
Grade be Р 5 d Going Down Lower 
во Бо 5 5 toto 5 5 Standard Standard 
дя | we | ЯЯ | më Pe Error of | Рег| Error of 
ET ES co ES |Cent| Per Cent Сепі| Per Cent 
v 157 153 98 97 73 .035 70 .035 
ДУ 140 152 95 96 67 040 66 .039 
III 166 163 98 99 39 .037 44 .039 
II 102 169 87 18 24 042 21 032 


* Twenty trials оп 30 d. v. and ten trials on 23 d. v. intervals. 


TABLE 4 


Mean and Variability of Scores for Two Terminologies (Going Up — Going 
Down and Higher-Lower) With the Seashore Piteh Record 


Standard De- Standard 
Grade Children Mean viation of Dis- Deviation 
tribution of Mean 
Going Up — Going Down 
У 156 42.62 10.26 0.82 
ү. 145 40.71 10.89 0.91 
ПІ 150 35.38 10.92 0.89 
II 97 31.66 10.59 1.08 
Higher-Lower 
v 143 42.07 9.96 
IV 143 37.52 12.99 
TII 149 35.48 12.18 
и 152 33.25 10.05 
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these results show no real differences between the terminologies. The 
mean seores made by each grade are not significantly different 
when using either terminology, as shown in the following tabula- 
tion: (This is computed from Table 4.) 


Ratio of Difference 


Groups Compared il 
Grade V (UD)** and Grade V (HL) 0.47 
Grade IV (UD) and Grade IV (HL) 2.26 
Grade III (UD) and Grade III (HL) 0.07 
Grade II (UD) and Grade II (HL) 1.07 


* For complete reliability, ratio of difference to standard error of 
equal 3.00 or more. 


** UD means "going up— going down." 
HL means "БірбегЛомег," 


Neither do the percentages who seored 90 рег cent correct responses 


on the easiest intervals differ significantly : (This is computed from 
Table 3.) 


difference must 


Ratio of Difference 


` Groups Compared to Standard Error 


of Difference 
Grade V (UD)* and Grade V (HL)** 0.60 
Grade IV (UD) and Grade IV (HL) 0.18 
Grade III (UD) and Grade III (HL) 0.94 
Grade II (UD) and Grade ТІ (HL) 0.71 


* UD means "going up — going down." 
** HL means “higher-lower,”” 
About the same number t 


urned in completed Papers in both in- 
stances, and about the ва 


me number seored 90 per cent correct 
tervals by each method (Figure 8). 

The really surprising results, however, lie in the fact that 27 per 
cent of the fifth grade children scored less than 90 per cent correct 
responses on the easiest intervals, intervals which approach a semi- 
tone in musical value. It seemed hardly possible that such a large 
Percentage was failing because of inability to hear or even to under- 
stand instructions. Some other factor, possibly pace, might have 
been operating to nullify any real differences in comprehension 
which might exist between the two terminologies. Hence ап ех- 
periment was arranged in order (1) to evaluate the two terminol- 
ogies when the factor of pace was controlled and (2) 
the importance of pace itself, 


Experiment for Evaluation of Two Concept Terminologies and the 
Factor of Pace 


The same intervals (30, 23, 17, 12, and 30 d.v.) 
the previous experiment, but instead 


to determine 


were used as in 
of using only the Seashore 
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record of pitch diserimination, another sound source w 
This was the bell series described in Chapter I. 

The bells were tuned to produce exactly the same intervals as 
the Seashore record and were rung in the same order. The only 
difference between the two tests was that the group set its own 
pace in the bell test, while in a comparable group the pace 
the standard instructions of the 
l test the experimenter sat at the 
back of the room and rang the bells. He was able to watch the re- 
imulus pairs only after 
у, signifying readiness 
ese tests were the same 
2 experiment, and the process was very 


as employed. 


е instructed in the use 
' terminology and Groups 2 and 4 
terminology. Groups 1 and 2 took the test 


under the standard Seashore Speed and Groups 8 and 4 set their 


own pace. 


Per Cent Meeting Criterion for 


M oing Down 
and Higher-Lower) Presented at Standard Paee (Seashore Pitch 
Record) and at Slower Variable Р. 


ace (Bells) 
Children | Per pent 90 Per Cent 
Tested |Completing Successes on 
Test Blank Two Easiest Intervals * 
Grade E ЈЕ Going Up— Higher- 
LE 1 58 x Going Down Lower 
EE 5 E Eli ЕН Рег Standard Per Standard 
88 | НЕ ЗЕ БЕ Cent| Error of lc nt Error of 
55 |85158 | 65 Per Cent 1"! Per Cent 
Bells 
У 21 26 95 81 95 048 80 078 
тү 38 49 97 86 92 -044 64 069 
ПІ 38 27 76 85 63 -078 62 .089 
п 84 19 85 67 48 086 22 095 
Victrola 
Ve 22 39 86 8& 77 090 72 
2 зі 079 
тү 88 43 82 88 73 072 74 .067 
ПІ 28 25 64 68 43 094 48 010 
ІТ 84 90 54 20 91 069 10 -067 
* Twenty trials on 30 d. v. and ten trials on 2 


3 d.v. 
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З 5 


PER CENT Successes 
5 


Figure 4. Per Cent Scoring 90 Per Cent Success on Easiest Intervals of Test 
(1) When **Going Up — Going Down?" Terminology Is Combined With 

a Slower Variable Pace and (2) When ‘t Higher-Lower?' Ter- 
minology Is Used With the Standard Victrola Pace 
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"Tables 5 and 6 with the accompanying figure (Figure 4) present 
the results of this experiment. Differences are at once apparent 
between the two speeds of the tests when the ‘‘going up — going 
down”? terminology is used. Mean scores when using the slower, 
variable pace (for both UD and HL terminologies) are higher than 
when using the standard speed with but two exceptions. Further- 
more, mean scores when using a combination of slow, variable pace 
with “going пр — going down" terminology are higher in every 
case than when using the '*higher-lower" terminology with the 
Standard расе. The following tabulation Shows the significance of 
these differences computed from Table 6: 


Ratio of Difference to Standard 
Error of Difference 


Grade У Grade IV Grade III Grade II 
Bells (UD)* and Bells (HL)** 0.23 


Comparisons 


3.15 1.2 1.34 
Bells (UD) and Vietrola (UD) 1.09 2.29 1.56 1.91 
Bells (UD) and Vietrola (HL) 1.41 1.22 0.85 


Bells (HL) and Vietrola (UD) 1.53 0.64 2.70 0.05 
Bells (HL) and Victrola (HL) 1.62 0.98 2.21 
Vietrola (UD). and Victrola (HL) 0.13 1.07 0.30 

7 UD means “going пр — go 

** HL means "higher-lower," 


А larger Percentage of children in every grade completed their 
Papers in the bell test than in the Victrola test when using the 
“going up — going down? terminology. A larger percentage also 
succeeded in Scoring 90 per cent correct responses on the easiest 
intervals when using the “going пр — going down" terminology. 
The significance of differences between the latter Percentages 14 
shown in the following tabulation as computed from Table 5: 


ing down." 


Е Ratio of Difference to Standard 
Comparisons Error of Difference 
Grade У Grade IV бтайе ПІ Grade IT 
Bells (UD)* and Bells (HL)** 1.63 3.41 “0.08 203 
Bells (UD) and Victrola (UD) 


1.67 3.8 s 2.46 

Bells (UD) and Vietrola (HL) 3.19 ae 2 3.49 

Bells (HL) апа Vietrola (UD) 0.25 0.90 147 0.09 

Bells (HL) and Vietrola (HL) 1.08 1.04 1.48 0.66 

Vietrola (UD) and Vietrola (HL) 0.49 0.10 0.51 1.15 
* UD means "going Up— going down," 


** HL means "higher-lower,'" 


The most significant difference in this ex 
slower variable pace with the ‹ 
inology and the standard Vietrg] 


periment is that between 
“going up — going down” term- 
а pace using the ""higher-lower?? 
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In general, both the results from mean Scores and from per- 
centages sueceeding on the test indicate that the “going up — going 
down'' terminology at a reduced paee (that is, with use of the 
bells) is the best method of testing, being effective with over 90 per 
cent of the fourth and fifth grades and with 50 per cent or more 
of the second and third grades. Thus it is apparent that both 


terminology and paee are important factors in determining pitch 
responses. 


Before leaving this section it is interesting 
standard pace was used with both the ""воїпе up — going down’? 
and ‘‘higher-lower’’ terminologies, the percentages scoring 90 
per cent successes on the easiest intervals in both this and the pre- 
ceding experiment are very similar. (Compare Tables 4 and 6.) 
Effect of Method of Recording on Test Performance 


In group tests there is another factor which may have an im- 


portant influence on performance; this is the manner in which the 
child records his responses. 


In the present study it w. 
Tows running across rathe 
instructions since the chil 


to note that, when the 


r than down the page. This simplified 
“во across the page 


trial row was Separated from 
enough to prevent confusio: 
row he was using. 
Methods of Recordin 
verbal response, 


the following row by a space large 
n on the part of the child as to which 


$ (x) in one of two 
drawing a line along one 
of two sounds (that is, 
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large number of cases was secured because of the number of com- 
parisons involved. 

In the letter method the subjects were told to record H, for ex- 
ample, if the second of two tones was higher than the first, or L 
if it were lower. In the cross method they were told to put a cross 
(x) in the higher box (demonstrating) if the last sound was higher 
or, if the last sound was lower, to put a cross in the lower box. 
Both the ‘‘higher-lower’”’ and ‘‘going up — going down” termin- 
ologies were used with each recording form. 

Tables 7 and 8 show the results of this comparison. The follow- 
ing tabulation contains data on the significance of differences be- 
tween mean scores made when using letters and crosses: (This is 
computed from Table 8.) 


Ratio of Difference to Standard 
Error of Difference 


Comparisons 
Grade У GradeIV GradeIII Grade ТІ 


Letters (UD)* and Letters (oly 0.45 1.33 0.26 1.02 
Letters (UD) and Crosses (UD) 0.27 0.24 0.02 0.66 
Letters (UD) and Crosses (HL) 0.43 2.07 0.18 0.35 
Letters (HL) and Crosses (UD) 0.19 1.14 0.26 0.46 
Letters (HL) and Crosses (HL) 0.05 0.76 0.40 1.98 

0.15 1.92 0.15 0.34 


Crosses (UD) and Crosses (HL) 
* UD means "going up — going down." 
** HL means "higher-lower." 


TABLE 7 
Per Cent Meeting Criterion for Two Methods of Recording (Letters "Versus 


Crosses) and Two Terminologies (Going Up — Going Down and Higher-Lower) 
Using the Seashore Piteh Record 


: Рет Сепі 90 Рег Сеп 
Са Completing Successes on 
Tested Test Blank Two Easiest Intervals * 
| Going Up— Higher- 
Grade ЕЁ БЕ Going Down Lower 
Вы | м 55 Standard Standard 
ЯН EE 8 Ei Е Per [Error of nu Error of 
oo ща ©д Ен | Cent | per Cent| ^^^ [Per Cent 
Letters 
Y то т 89 82 68 .052 69 .050 


9 79 73 61 .056 55 .047 
M 5 р 74 2 89 .055 48 .057 


28 .054 29 .055 


II 68 69 68 62 
Crosses 
71 87 85 75 .048 68 .034 
M б 79 83 62 57 044 44 055 
ПІ 18 78 72 68 36 .054 40 .056 
II 36 36 50 62 17 063 21 .068 


* Twenty trials оп 30 d.v. and ten trials on 23 d. v. 
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The significance of differences for percentage successes оп the 
easiest intervals follows: (This is computed from Table 7.) 


Ratio of Difference to Standard 


Comparisons Error of Difference 
Grade У GradeIV GradeIII Grade II 
Letters (UD)* and Letters (HL)** 0.01 1.03 1.21 0.01 
Letters (UD) and Crosses (UD) 0.97 0.94 0.09 1.31 
Letters (UD) and Crosses (HL) 0.00 1.41 0.11 0.84 
Letters (HL) and Crosses (UD) 0.91 0.01 1.28 1.38 
Letters (HL) and Crosses (HL) 0.02 0.21 114 0.96 
Crosses (UD) and Crosses (HL) 0.88 1.49 0.37 0.61 


* UD means "going up — going down." 

** HL means “higher-lower.” 

It is apparent that no significant differences exist between the mean 
seores made or the percentages sueceeding when using letters or 
crosses with either terminology (‘‘going up — going down’’ or 
‘“‘higher-lower’’). There is a slight tendency, however, for more 
correct completions on tests where letters were used. 

An analysis of questions asked by children indieates that the 
'*higher-lower? terminology, when used in conjunction with the 
cross method of recording, was the most difficult. Children’s state- 
ments also indicated that the test was too fast for all groups (the 
Victrola record was used in this comparison). The children pre- 
ferred the cross method to the letter method despite the fact that 
performance was slightly better on the latter. 

Since the letter method showed so little superiority over the 
cross method and since the children preferred the cross method 
(or at least a method where marks rather than letters were used), 
it was decided to compare the letter method with a third possibility. 

Comparison of Letter Method With. Circle and Road Method. — 
In this comparison 424 children, grades two to five inclusive, from 
three Des Moines elementary schools served as subjects. 

Since the data had indicated that the ‘‘going up — going down’’ 
terminology was more effective than the ‘‘higher-lower’’ term- 
inology and that the bells rather than the Victrola record were 
best suited for testing at this level, the present comparison was 
made using the bells for sound and pace and the “воле up — 
going down" terminology for instruction. 

By using the bells it was possible to employ one large interval 
(30 d.v.) throughout the test rather than a series of increasingly 
difficult ones as used in the previous testing comparisons. 

The recording form (Form C) with which the letter method was 
compared is known as the circle and road form. The subjects were 
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instructed to place Ше point of Ше pencil in the little circle and 
listen to the two sounds. If the last sound was going up, they were 
to draw a line upward on the “ар” road; if the last sound was 
going down, they were to move their pencils downward оп the 
*t down?! road. Then they were to put Ше pencil point in the next 
cirele and listen for the next sample. 

Tables 9 and 10 present the results of the comparison. From 
these results it is apparent that in both methods from the second 
grade on praetieally all of the children filled out their papers. 
However, the per cent of children who scored 90 per cent successes 


on the whole test is relatively low. 


TABLE 10 


Mean and Variability of Seores for Two Methods of Recording (Letters Versus 
Cirele and Road) and Two Terminologies (Going Up — Going Down and Higher- 
Lower) at a Slower Variable Pace (Bells) 


Standard De- Standard 
Grade Children Mean viation of Dis- Deviation 
tribution of Mean 
Letters 
У 33 37.81 8.91 1.56 
ТУ 59 40.44 8.55 1.11 
11 63 40.35 9.96 1.26 
II 46 32.72 9.69 1.43 
Circle and Road 
У 33 39.48 11.82 2.07 
ТҮ 61 39.86 11.10 1.42 
ПІ 66 40.29 9.99 1.23 
п 57 32.86 10.02 1.34 


псев between mean seores and between 


The significance of differe 
90 per cent successes for each grade is 


per cent of children scoring 


given below: 
Ratio of Difference to Standard 


Errors of Difference 


Groups Compared in Terms 
For Percentages 


of Letter and Circle For Means 
Grade V 640 с 
320 « 
Grade ТУ Б 
Стаде ПІ .003 .640 
Жер 070 380 


Grade II 


While a larger percentage 
children scored 90 per cent su 


of second, third, and fourth grade 
ecesses on the whole test when using 


ing the same intervals tended to lower 
have entered into the differences found 
that they came from school 


816 may be that the monotony caused by us 
attention and interest. Another fact which may 
between these groups and previo 
of the city probably lower іп socio-e 


jus groups Was s in a section 
conomie status. 
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the circle and road method than when using the letter method 
(Table 9) and while a larger pereentage of fifth grade children 
was successful when using the letter method, the differences are not 
significant. 

Despite the fact that the differences between methods are small, 
the statements of the children, especially in the lower grades, indi- 
cated a greater interest in the circle and road method than in the 
letter method. It was decided, therefore, on the basis of interest and 
objective scores (that is, per cent successful on the whole test) to 
use the circle and road method in the second, third, and fourth 


grades and the letter method in the fifth 


grade in subsequent 
testing. 


INDIVIDUAL TESTS 


In the section just completed it has been Shown that there were 
many children at the lower levels who failed to understand. There- 


determine whether children who 


hen examined 
y of the complications of the 
tation indicated the easier of 


‘going up — going down," it was 
used in the individual testing which follows. 


The subjects for the individu 
preceding section (р. 23-24). 
Тһе сепігаї question involv: 


determine whether the children understood the concept involved. 


It seemed apparent from observation that the five-year-old child 
in the preschool labor; 


al testing һауе been described in a 


attempt to clarify this question. 
problems were sought : 


1. Do children really understan 
2. Do these same children un 


3. If a verbal conce 
an age as that in the visual field, wh: 
aining on 
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Comprehension of “Сото Up — Going Down" in the Visual Field 
With. Minimal Demonstrational Training 

In this experiment 126 children from the Iowa preschools and 
from the first and second grades of the University Elementary 
School were used as subjeets. The children were brought into the 
testing room individually and seated at a low table. A small two- 
step model was placed before the subject, on each step of which 
was а “Бей” (described in Chapter I). The bell on the lower step 
was a major third lower in the musieal seale than the one on the 
upper step. The experimenter told the ehild to wateh and listen. 
* When I play the bells like this, the sounds are going up the steps, 
aren't they? When I play them like this, they are going down, 
aren't they? Now, like this — which way? That's right. And like 
this? That's right. Now you watch me and listen to the bells and 


tell me whether the sounds are going up or going down. Ready ?”’ 


Each child was then given а short ten trial test, the results of 
whieh are shown in Table 11. These results show that all of the 
children five years or more of age could easily comprehend the di- 
rections while watching and listening and could use the verbal 
concept “коша up — going down’’ perfectly. Although some of 
the responses may have been based on auditory discrimination, it 
seems more probable that vision played the major part in the 
responses. 

Comprehension of "боту Up — Goin i 
Field With Minimal Demonstrational Training 

The second experiment was carried out to determine whether these 
children could use the verbal concept terminology of “going up — 
going down" in the auditory field alone. г 

After the child had been given the short visual demonstration, 
he was told to turn in his chair so that his baek was toward the 
sound source. He was then told to close his eyes ‘во he eould hear 
very well" and was instructed to listen to the bell sounds. If the 
sounds were going up, he was to respond ‘‘going up’’; if they were 
going down, he was to respond ‘‘going down." 


Тһе experimenter rang the bells and recorded responses during 


the test, whieh was twenty-five trials long except for those who 
scored nine out of ten correct responses on the first ten trials. It 


required but a short time to present the test. T 
Table 11 shows the results of this test. These results indicate 


that while in the visual experiment the five-year-old subjects could 


g Down" in the Auditory 
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TABLE 11 


Ability to Use a Simple Verbal Concept in the Fields of Vision and Audition 
as Age Inereases 


Л і Аве, Уеатв 
Children [21516] 7|8 То [10 
Vision 
Chronological Age 
Tested 21 26 21 35 93 
Рег сепі Scoring 100 per cent successes 
on ten trial visual tes: 57 100 100 100 100 
Mental Аде 
Tested 9 10 28 17 за 37 16 
Per cent scoring 100 per cent successes 
on ten trial visual test 44 100 100 100 100 100 100 
Audition 
Chronological Age 
Tested 


21 26 21 35 93 
Рег cent scoring 90 per cent suecesses 
when using verbal concept in audition 10 23 94 80 


Mental Age 
Tested 9 10 95 17 24 977 16 
Per cent scoring 90 per cent successes 


when using verbal concept in audition 0 10 = 18 XT 


78 81 


readily comprehend and use th 
down," in the auditory experi 
year-old group eould use the te 
years of age not all of the subje 

An examination of Table 11 i 
tionship between age and abi 
“going up — going down" 
auditory field, Figure 5 


е terminology ''going up — going 
ment only one-fourth of the five. 
rminology correetly. Even at eight 
ets in the group were successful, 

ndicates that there is a definite rela- 
lity to understand the concept of 
when used in either the visual or the 
shows this relationship clearly. The rela- 
tionship in the auditory field is especially noticeable when mental 


rather than chronological age is considered. It is interesting to note 
that there is a marked j six and seven 
year level in chronologie: 
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ity to respond correctly in the auditory field may be due to a lack 

` of experience and ргас се. Therefore, the following experiments 
were carried out to determine the effect of ргасіїсе in learning a 
verbal сопсері in audition. 

Effect of Minimal Training. — Two groups of children, (1) those 
unable to understand instruetion after a short individual test and 
(2) those unable to Pass a group test because of comprehension 
difficulties,” were given a period of short individual practice. Each 
child was given a combined test-practice-test series in which the 
following procedure was used: (1) eight demonstration trials,!^ 
(2) ten test trials, (3) ten practice trials, (4) ten test trials, (5) 
ten practice trials, and (6) ten test trials. Between each of these 
Series there was a short pause for relaxation. The bells were used 
as sound stimuli, thus permitting the investigator to proceed with 
the test at a расе determined by the child. 


TABLE 12 


Effect of Minimum Individual Practice in Learning a Verbal Concept in Audi- 
tion for Sampling of Children Failing a Group Test 


і Рег Сеп Testable 
Children — 
Стаде Tested Individually After Ten After Twenty 
Individually Without Test And Ten |Test and Twenty 
Practice |РтаеНее Trials | Practice Trials 
Y [j 66 66 66 
ту 12 36 55 55 
ІП 14 17 99 28 
п 16 6 31 


37 
Tables 12 and 13 show the results of this experiment, In the first 
situation, where the child was not given a group test before train- 
ing was begun, Ше per cent of subjects Succeeding in 90 per cent 


of the trials at the Several age levels before and after training is 
relatively low. This may have been due to sampling variations. 
In the latter situation the childr 


trated instruetions, trial tests, and repeated instructions between 
tests in the group situation before they were tested individually. 
Only those were tested individually Whose papers Showed objeetive | 


9 The first group contained chil 
ond group was drawn from Des Moi 


у ng room. If they 
a т Ties, they were tested immediately. {ог pitch 
discrimination, 
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TABLE 13 
Effect of Minimum Individual Ргасбісе in Learning a Verbal Concept in 
Audition 
Е Per Cent Testable 
Children 
Age, Years Tested After Intro-| After Ten After Twenty 
duetory Test and Ten |Test and Twenty 


Practice Trials 


Individually Ei 
1 


ght Trials |Practice Trials 
1 8 13 25 25 
8 15 13 21 27 
9 18 28 39 50 
10 97 7 28 41 
11 12 8 25 33 


evidenee that they did not understand how to perform successfully. 

Table 12 shows that a considerable percentage of children (in- 
ereasing with age) is immediately testable when they enter the 
individual testing room. Their failures in the group situation may 
have been due to recording difficulties, group distraction, and so 
forth. Some of them probably understood the problem toward the 
end of the group test and hence were ready to be tested immediately 
by the time of the individual period. On the other hand, it may be 
seen that a certain percentage іп the second, third, and fourth 
grades did profit from the short individual training period. Whether 
such inereases in percentage testable are worth the effort necessary 
in obtaining such data depends upon the viewpoint of the in- 
vestigator. 

Effect of Extensive Training. — 
ing period did yield a certain percen 
question arose as to whether a longer period of train 
increase this percentage. This problem is taken up in this section. 

In this experiment 126 children from the preschools of the Iowa 
Child Welfare Researeh Station and the University Elementary 
School, ages three years, six months to eight years, five months, 
were given an initial individual test to determine whether they 
understood the ‘‘going up — going down”’ terminology in audition. 
After the initial testing was completed group training was begun. 
This was carried on by the regular music teacher during the daily 
musie hour for a period of five weeks. At the end of the five weeks the 
children were again tested individually. 

Table 14 presents the results of the test both before and after 
the group training period. These results indieate that the greatest 
improvement with extensive training comes between the years five 
and seven, and that prior to and after these years there is little 


Sinee the short individual train- 
tage of improvement, the 
ing might not 
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TABLE 14 
Effect of Extensive Group Practice in Learning a Simple Verbal Coneept in 
Audition 
я Per Cent Testable Per Cent Gain 
Age, Years п Balore After |from Extensive 
Training Training Training 
Chronological Age 
4 16 10 19 9 
5 20 23 40 13 
6 21 24 52 28 
7 30 80 87 7 
8 21 65 71 6 
Mental Age 
ce 5 0 0 0 
5 T. 10 14 4 
6 19 4 16 12 
7 16 18 38 20 
8 24 71 83 12 
9 23 78 91 13 
10 14 81 86 5 


improvement. Tt is interesting to note that during this same period 


the greatest improvement without extensive training also occurs. 

In general, evidence from these training experiments indicates 
that practice as such has relatively little effect on ability to apply 
the “воше up — going down” concept in the auditory field before 
the child has achieved a certain mental status. What improvement 
does occur comes as easily with a minimum of practice as with a 
longer training period after the child is mentally ready. 


Summary of the Verbal Concept Technique as а Testing Method. 


The results from group tests on 2,957 children have indicated 
that various factors have a significant effect on test scores. It has 
been shown that children make b 
tions involving a verbal concept terminology of “going up — going 
down” than when using '*higher-lower.^ They also make better 


Scores when the speed of the test is made variable and set to the 
расе of the group tested. 


Children do not make si 
апу one of the types of г 


etter seores when using instrue- 
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pencil and paper test below the third grade, therefore, is question- 
able. 

In regard to motivation in group tests, the evidence available 
(the children's statements and questions and the number of com- 
pleted papers) indieates that a sustained attention, if not a sus- 
tained interest, was present throughout the tests in the majority 
of cases. Modifications of pace, method of recording, and terminol- 
ogy in themselves supplied certain motivating influences. 

Individual tests of pitch discrimination in which a verbal con- 
cept of ‘‘going up — going down" was used were effective as sup- 
plementary tests for group failures in grades two to five inclusive. 
The percentage of young children four, five, and six years of age 
testable individually by means of the verbal concept decreases rap- 
idly as age decreases. There is a sharp drop from seven to five 
years of age. Neither a minimal nor a long period of training was 
effective in teaching the verbal concept of “going up — going 
down”? in the auditory field, although the same concept was un- 
derstood in the visual field by all children at five years of age. 


CHAPTER IV 


GROUP AND INDIVIDUAL TEST RESULTS 
CHOICE OF INTERVALS 


of 100 trials. 

Data regarding the relative diffieulty of the intervals of the Sea- 
Shore test are to be found in Larson's study (10).? Figure 6 shows 
the mean scores in terms of per cent right for each interval of the 
Seashore piteh diserimination test for the fifth grade. The graph 
was constructed from Larson's data. 

Theoretically the curve should move downward in a more or less 
regular manner, This appears to be true for most cases, although 
some intervals are not as regular as might be expeeted, Since the 
differenee between 8 and 5 d. v. is not as great as that between 5 
and 2 d. у. (Figure 6), one might expect to secure a m 
distributed and possibly a more diseriminative series by narrow- 
ing the 5 d. v. interval to 4 d. v. On the basis об Larson’s 


ore evenly 


NUMBER OF TRIALS 


A relatively short group test is necessary when dealing with 
young children, Statistically, of course, this would not seem de- 
sirable, since shortening a test usually reduces its reliability. 


4T 
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The statistical objection to a shortened test, however, holds only 
under certain conditions, as has been pointed out by Lanier (9) 
and Larson (10). In a study with adults Lanier, for instance, con- 
cluded that the Seashore pitch test might be reduced to one-half its 
present length and still be as reliable as the whole test. Larson (10, 
p. 31) found the test most reliable when seventy trials were used in 
place of 100. 

Previous experimentation in this study indicated that a fifty trial 
test could easily be presented to children if broken into five ten- 
trial series which allowed for repetition of instruction and a brief 
moment of relaxation. A test of greater length was considered in- 
advisable. It was decided, therefore, to construct a test within a 
limit of fifty trials. The final test consisted of five intervals, 30, 17, 
8, 4, and 2 4.у., each of which was to be presented to the children 
ten times in a random order previously determined, beginning with 
the easiest interval and proceeding to the most difficult. 


EXPERIMENTAL EVALUATION OF TEST 


Subjects 

The test was given to 715 children in the first, second, third, 
fourth, and fifth grades of three Des Moines publie schools. The 
children were tested in groups ranging in size from twenty-three to 


thirty-seven. 


Apparatus and Materials 

Six metal bars as described above were tuned to the following 
frequencies: 440, 442, 444, 448, 457, and 470 d.v. (Chapter I). 
set in wood supports were tuned so that 
aximum degree to the 440, 442, 444, 
to the 457 and 


Three resonating cylinders 


two of them responded to аш 
and 448 bars, while the third responded maximally 


470 bars. 

A printed schedule was used. The exact procedures using the 
“going up — going down" terminology and the recording forms 
used during the test are described in detail in the next section of 
this monograph оп “А Piteh Discrimination Test for Young Chil- 
dren.” 


OBJECTIVE CRITERIA OF TEST COMPREHENSION 

On the basis of results already reported in this study and else- 
where, every possible aid was given the children to help them un- 
derstand the instructions of the test. A preliminary illustrated 
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practice period was held before every test. Before a single trial of 
the aetual test was presented, the experimenter asked, “15 there 
anyone who has any questions? Don't be afraid to ask me because 
І want you all to be sure to understand so you can get good scores." 


Not until all the children indicated that they understood was the 
test begun. 


It was felt that, in addition to the above preeautions, an ођјес- 
tive check should be made on Whether or not instruetions were 
understood. To make this check, the first interval in the test was 
purposely ehosen large enough (30 d. V. approximately a musical 
semitone) so that all but the very poorest ears could hear it easily. 
The following eriteria of comprehension were set up: (1) at least 
nine of the first ten trials, all with 30 d. v. intervals, should be 
scored eorreetly (probability of suecess here is 1 in 512); (2) there 
should be no evidence of reversal of instruetions, as for instance five 
or more conseeutive wrong responses on any ten trial series in the 
test; and (3) all papers accepted should be completed. 

Тһе following tabulation gives the proportion of children meet- 
ing the criteria of comprehension : 


Children Children Meeting Рег Cent Meeting 


Grade Tested Criteria Criteria 
У 136 97 71 
IV 157 100 64 
III 204 102 50 
II 153 48 31 
T 65 4 6 


It may be well at this point to compare the present fifth grade 


son (10). The broken line in Figure 8 is 
constructed from Larson's data and shows the mean seores for fifth 
grade children on the various inerements. The dotted line was ob- 
tained similarly from the fifth grade data in the present experiment. 
The solid line was obtained from the same group as that represented 
by the dotted line, after the objective eriterion of comprehension had 
been imposed. 

The difference be 


: tween the curve of Larson and the one of Hatt- 
wick based upon 


what may be called the observational criterion 
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sion have been objectively eliminated. The differences between this 
curve and Larson’s raise the question as to whether the factor of 
comprehension was adequately cared for in her procedures. 

In the present study the objective criterion of comprehension was 
used to make sure that the only tests ineluded in the results were 
those which were fairly valid measures of piteh diseriminative 
ability. The results which follow are based only upon those cases 
passing the objective criterion of test comprehension. 


RESULTS 
Difficulty of the Test Intervals 

Table 15 and Figure 9 show the mean scores m 
on each of the five intervals chosen. 

In each grade the results are fairly satisfactory. With the pos- 
sible exception of the second interval (17 d.v.), which is not as 
difficult as might have been expected, the intervals show relatively 
consistent increase in difficulty. However, the relatively large num- 
ber of successes оп the 17 and 8 d. v. intervals indicates that the 
test might be made better by the removal of one of these intervals 
and the substitution of a more difficult one. 

Table 15 shows that the mean scores on this test for the four 
grades second to fifth inclusive are 41, 42, 42, and 43 (means round- 
ed) out of a possible 50,13 Although these mean scores are higher 
than had been anticipated on the basis of previous experimental 
work, the test does discriminate fairly well as shown by the stand- 
ard deviations. It does not discriminate sufficiently, however, to 
take care of a large number of individual cases; it seems probable 
that a better test would result if, instead of intervals of 30, 17, 8, 4, 
and 2 d. v., a series of 30, 8, 4, 2, arid 1 d. v. was used. 


ade by each grade 


Reliability 


The total number of children 1+ 
prehension (that is, the cases used 
reliabilities obtained 15 


passing the criterion of test сош- 


for reliability purposes) and the 
are given in the following tabulation: 


posed largely of homes of offic 
15 These reliabilities were h 
K е alves method, 8 У la 

being used to determine the hole test (4) eer 
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Children Per Cent Passing 


Grade Tested Criterion т РЕ, 
У 136 71 .761 .028 
IV 157 64 .863 .017 
ПІ 204 50 .879 .015 
Ir 153 31 .837 .027 
І" 65 6 
Total 715 49 


* The meagre returns from the first grade made further consideration of group testing 


at this level impractical. 


The reliability of the test for all grades combined was 789 = .013. 

Тһе reliability of the test for the second, third, and fourth grades 
is relatively high, while that for the fifth grade is not so high as 
might be hoped. The reason that correlations are higher in the 
earlier grades is probably due to the fact that these scores showed 
greater variability; the test was somewhat too easy for the fifth 
graders. 

Since the cases included in these reliabilities quite certainly dis- 
eriminated on the basis of pitch, the test would seem to be a re- 
liable and valid measure of pitch discrimination for the second, 
third, and fourth grades. Before comparing the results with those of 
other studies, it will be worth while to observe the reliabilities which 
would have been obtained had an observational rather than the 
more objective check been made on scores to be included in the 
final results. This information was obtained by determining relia- 


bility for all cases except those showing obvious difficulties in under- 


standing, as evidenced by failure to complete papers, cheating, 
evious inves- 


and so forth, the same сћеск that was employed by рг 
tigators. The following tabulation shows the results: 


Children Рег Cent Usable 


Grade Tested for r r PE, 
№ 136 98 .893 .012 
ту 157 96 .923 .008 
III 204 94 .925 .007 
II 153 67 .861 4017 


When these reliabilities are compared with those above, real 
differences are observable. While the reliabilities are higher accord- 
ing to the latter grouping, it is more probable that the test is, in 
part, a measure of comprehension of instruction and hence a less 
valid measure of pitch discrimination. 4 

АП of these data point to the conclusion that careful wording of 
instructions, adequate practice trials before testing, and eareful 
control of conditions during testing are not enough to insure ade- 
quate comprehension and correct application of test instructions. 
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Even when these conditions are met and the child apparently un- 
derstands instructions, it is still wise to provide an objective check 
on his performance before deciding that the test is a true measure 
of diseriminative ability. 

Effectiveness of Present Test 


It is significant that the number of children testable at the several 
levels drops rapidly as one goes from the fifth to the second grade. 
This is undoubtedly due in part, however, to the rigidity of the test 
criteria of comprehension, 

The criterion of comprehension was purposely made high in order 
to include only those cases for which there was a high probability 
that the children were responding to pitch differences alone. Such a 
criterion excludes the child who, though he did not understand the 
instructions at the beginning, might have understood the test by the 
end of the first or second ten trial series. Since these first twenty 
trials were relatively easy, one might expect a number of instances 
of this type to have occurred. Before judgment can be passed on 
the test as an effective and useful measure at the lower age levels, 
it will be necessary to analyze the test scores further in an effort to 
determine whether the actual percentages testable are not higher 
than suggested here. This information is included in the following 
section, which deals with causes of failure in group tests. 


Causes for Failures in Group Tests 


The major causes for failures in the group test may be ineluded 
under four categories: (1) inability to understand instructions; 
(2) misunderstanding of directions (reversals) ; (3) factors of in- 


attention, disinterest, or fatigue; and (4) inability to discriminate 
piteh differences. 


After the above test w; 
satisfied the eriterion of 
papers still remained whi 


as given and those papers selected which 
comprehension, the following number of 
ch were tentatively classed as failures: 


1 Стаде 
Children Failing У ту ПІ ІП І Total 
Number 28 32 60 82 60 264 
Per Cent 20 32 46 69 93 48 
Inability 19 Understand Instructions. — Ап analysis was made 
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seeond or third ten trial series. The following tabulation contains 


this material: 
Grades 
№ IV III II T 


Children 
Per Cent 


Scoring nine successful responses on first ten 
trial series 
Scoring nine sucecessful responses on second 
or third ten trial series but not on first 5 5 
Total discriminating on at least the last thir- 
ty trials of the test* 85 73 68 44 20 
* No papers are included which contained reversals of concept or which were incom- 
plete, 


If it is assumed that once the children **got Ше idea’’ of the test 
they kept it throughout the period,? a considerable number of 
cases is added to the ranks of those who were really discriminating 
during the important part of the test. 

These last results reflect more accurately the true value of the 
test as a diagnostic measure. Familiarity with the test as it pro- 
ceeds, learning (ог, perhaps better, insight), and increased atten- 
tion and interest as the child comprehends the ‘‘idea’’ are prob- 
ably all operating throughout the testing situation. That this is 
especially true at the lower levels is seen in the increased percent- 
ages of children who started to discriminate, or at least to report 
correctly, after the initial ten trial series. 

Although these cases were not included in determining the re- 
liability of the test or in setting up tentative norms, it seems that 
the scores of those who ‘‘got the idea” later in the test can justi- 


fiably be compared with those who had it throughout if each ten 


trial series preceding the suecessful series is given a score of 9. In 
dealing with individual cases such a procedure is suggested. 
Reversals. — Another group of cases is made up of children who 
consistently recorded 5, 6, 7, 8, 9, and even 10 consecutive re- 
sponses wrongly out of a possible ten. The chances that a child could 
record five consecutive responses wrongly by guessing are 1 in 82, 
for six consecutive wrong responses 1 in 64, for seven 1 in 128, 
and so forth. Hence it seems unlikely that these consecutive wrong 
scores are due entirely to chance. More probably they are due to а 


reversal of the concept terminology. 
In the present test the following регее 


80 68 54 ЗІ Т 
14 13 13 


ntages of children were 


һу noting whether the responses after the successful 
ued to be correct or, because of the difficulty of the following in- 
t right fell off in а regular manner. Only four out of fifty-five 
sequences. These four cases were discarded. 


16 This was checked objectively 
ten trial series contin 
tervals, whether the per cen 
cases did not follow one of these 


58 IOWA STUDIES ІХ CHILD WELFARE 


found to score five or more consecutive wrong responses on one of 
the first three ten-trial series: 


Grades 
Children v IV III II т 


Per Cent 
Reversals (five or more consecutive wrong re- 


зропзез on amy one of first three ten-trial 

series) 17 19 93 85 19 
Reversals (nine ог ten eonseeutive wrong re- 

sponses on апу one of first three ten-trial 


series) 4 8 9 9 0 


The total number of cases Showing reversals (126 савез) 


sents 18 per cent of the 715 children who were given the 
tests. 


repre- 
group 


As might be expected, there is more reversing of сопсері as the 
age scale decreases except for the first grade, where the test was 
obviously of little value. This occurrence of reversals may account 
for the fact that the Larson norms begin below 50 per cent right 
responses. It is a factor which must always be considered in test- 
ing for pitch discrimination. 

Inattention, Disinterest, Fatigue. — There is а third elass of 
failures due to such factors as inattention, disinterest, fatigue, and 
зо forth. There are undoubtedly some seores in the present data 
which are low because of such factors. Probably very few scores 
were affected by fatigue in the physical sense, since the test re- 
quired little physical effort and was short. All that can be said 
concerning the influence of these factors on the present test is that 
they seemed to be fairly well controlled as evidenced, for example, 
in the reliabilities obtained on the test. 

Fourth Class of Failures. — The fin 
those children who could not discrimi 
30 d. v. No clear-cut eriterion b 
entiated has appeared to date. 


VALUE OF THE PRESENT GROUP TEST FOR CHILDREN 
IN FIFTH GRADE AND UNDER 


One of the objectives of this investigation was that of building 
a practicable test of pitch discrimination for younger children. 
Three considerations will be discussed in this connection: economy 
of time, expense, and minimal training for valid testing. 


al class of failures contains 
nate differences as large as 
y which this class could be differ- 


Economy of Time 


The present ‘test, having but five intervals and a total of fifty 
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trials, takes approximately seven minutes for aetual presentation 
after the period of instruetion has been completed. The whole period 
for testing and instruetions seldom lasts more than twenty minutes. 
It is quite possible, therefore, to double the length of the test where 


greater reliability is desired. 
Cost of Materials 

As the test is now constructed, the aetual eost for materials is 
approximately as follows: 


6 Deagan bars $ 9.00 
3 Resonators (built by author) 3.00 
1 Rubber-headed mallet 25 
Reeording blanks (per 500) 1.50 
Total eost $13.75 


There remains also the possibility of recording the test on а phono- 
graph dise, whieh would reduce the cost considerably for general 


testing purposes. 
Effectiveness With Minimum of Training 

The present test permits the inclusion of at least 90 per cent of 
the fifth, 83 per cent of the fourth, 64 per cent of the third, and 56 
per cent of the second grades as testable material with a minimum 
of training. With the exception of the first grade, profitable re- 
turns may be expected by using the group test. 


TENTATIVE NORMS 


Cases included in a tentative norm summary in this study have 
been aecepted only after an objective criterion of test comprehen- 
sion was satisfied. Such a validating procedure resulted in more and 
more exclusions as the grade scale descended. The following tabula- 
tion shows the deeline of available cases for normative purposes: 


Grade 
Cases v IV III II T Total 
Tested 136 157 204 158 65 715 
Used for Norms 97 100 102 48 4 351 
71 64 50 31 6 49 


Per Cent for Norms 


It was hoped that approximately 100 eases would be available for 


1: These percentages were obtained by adding the per cent of group test failures who 
were testable individually after a minimal training to the per cent testable in the group 
test, Since it was not possible to test all of the group failures individually, а random samp- 
ling of forty-eight was taken and the per cent of these who became testable after minimal 
training was used as an approximation of what testing a larger group might indicate. 
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each grade. But it soon beeame apparent that this was praetieally 
an impossibility in the first and second grades. 

Norms are also made diffieult because the scores on the tests are 
all relatively high due to the ease of the first twenty trials. The 
result is small range values for each grade (Table 16). 

Because of the above facts, the values presented in the tabula- 
lion on page 73 of Part Two of this monograph are given for only 
fifth percentiles. These norms are presented as tentative material. 


TABLE 16 


Group Distribution in the Final Pitch Test of Scores for Tentative Norms for 
the Fifth, Fourth, Third, and Second Grades 


Grade 
Raw Score 
У | ту | ІП | II 
50 2 3 1 1 
49 4 4 4 2 
48 7 8 3 1 
47 7 9 7 3 
46 7 10 10 3 
45 8 9 13 3 
44 11 6 4 3 
43 9 6 5 4 
42 5 3 12 6 
41 9 ff 6 0 
40 9 12 6 6 
39 8 5 (4 2 
38 5 1 6 5 
37 9 8 2 2 
36 1 1 2 1 
35 1 3 3 1 
34 1 2 4 0 
33 Хі 1 1 0 
32 
81 i 1 
30 2 1 2 
29 3 1 
98 1 
27 1 
26 
Cases 97 100 102 48 
Mean 42.95 41.95 41.58 41.00 
Standard Deviation 3.77 4.54 5.06 5.29 


GROUP DISCRIMINATION LIMENS 


Discrimination as Related. 10 Age and. Grade 

Тһе question of the signifieane 
for the different grade levels is 
to this question may be found f 


€ of the differenees in mean scores 
of theóretieal interest. An answer 
rom the raw data by determining 


MEASUREMENT OF MUSICAL DEVELOPMENT 61 


whether there are any real differences between the group seores on 
the hardest intervals of the test. The following tabulation shows 
that there are no significant differences between mean scores made 
by the several grades on the 4 d. v. interval; on the very hardest 
interval (2 d. v.) real differences begin to appear: 


Ratio of Difference to Standard Error of Difference* 


Between Total Between Group Between Group 


Comparison Group Seores Means on 4 ду. Means on 2 d.v. 


on all Intervals Interval Interval 
Grades V and ТУ 0.53 0.04 1.97 
Grades V and III 0.80 0.08 0.82 
Grades IV and III 0.38 0.13 9.30 
Grades V and II 1.31 1.83 2.31 
Grades IV and ІІ 0.57 1.72 3.63 
Grades III and II 1.02 1.75 1.66 


* Must ђе at least 3.00 to insure reliability of differences between means. 


Тһе fourth grade scored significantly more successes on the hard- 
est interval than the second grade. The differences between the 
mean scores for the fifth and second and for the fourth and third 
grades approach significance ; in both eases the higher groups made 
better scores. Since these results were obtained from cases all of 
whom, according to objective evidence, understood the test and were 
responding primarily to piteh differences, it is suggested that there 
are slight differences in discriminative ability at the different grade 


levels investigated in this study. 
SUMMARY 


A test of pitch discrimination for younger children has been con- 
structed in which the discriminative intervals, length, and instrue- 
tion, so far as could be experimentally demonstrated, were best 
adapted to children’s level. The test was given to 115 children in 


the first five grades of school. 


Analysis of the data shows that the test was a reliable and valid 


measure for all but the first grade. Although it was not applicable 
to all subjects tested, it did measure the abilities of 56 per cent of 
the second grade, 64 per cent of the third, 83 per cent of the fourth, 
and 90 per cent of the fifth grade under certain conditions (that is, 
with a minimum of training). It was most effective and reliable at 
the third and fourth grade levels. 

The most important limitation of the test is that its diserimina- 
tive value was not high enoügh at the easier and harder ends of the 
seale. A suggested alteration of the test to improve the range of 
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difficulty was made. The suggested change should not affect the 
reliability or validity of the test, exeept perhaps to raise the relia- 
bility slightly. 

Although individual tests are recommended below 
ineluding the second grade, it was show: 
termine by means of a group test the re 
erimination of most children by the ei 

Finally, the objeetive eriterion 


and perhaps 
n that it is possible to de- 
lative abilities in piteh dis- 
ghth year. 

of group test comprehension used 
in the present study deserves special mention since it clearly showed 
the unreliability of the examiner's observation as to whether or not 
subjeets were following directions carefully. Data secured on the 
basis of this eriterion were used in deriving tentative norms. These 


data suggest that pitch discrimination may improve slightly with 
age. 


СНАРТЕК У 


GENETIC GROWTH OF DIFFERENTIAL 
PITCH SENSITIVITY 


NORMATIVE SUMMARY OF DIFFERENTIAL REACTIONS 
TO SMALL PITCH DIFFERENCES FROM FOUR 
TO TEN YEARS OF AGE 


As a result of the experiments which have been presented in 
previous sections, it is now possible to present a pieture, or norma- 
tive summary, of certain aspects of the genetic growth of differ- 
ential pitch sensitivity. Such a summary brings together in or- 
ganized form the broader conclusions of this study. 


Four-Year-Old Children** 

If the “раше” is made interesting enough, the four-year-old 
child will sing readily. Only one child out of ten, however, has 
developed sufficient voco-motor control to respond in the right 
direction to simple two tone intervals. None can sing intervals ас- 
curately at this level. 

Although five out of ten ehildren ean use successfully a verbal 
concept of ‘‘going up — going down" in the visual field, only one 
in ten can use the terms successfully in the auditory field. 

Not more than two out of ten children at this level can report 
verbally or by singing whether they are able to diseriminate large 


piteh differences. 


Five-Year-Old Children 
АП ehildren at this level will sing readily. Every fifth child ean 
sing directionally and every tenth child сап sing accurately the 


interval heard. Five-year-old children can tell the experimenter the 


difference between ''going up — going down" in the visual field. 


Only two out of ten before extensive practice and four out of ten 
after extensive practice are able to report successfully in the audi- 


tory field. 


18 By a four-year-old child is meant а child from three years, six months to four years, 
five months of аве. The five-year-old child refers to а child four years, six months to five 
years, five months. The same description is true of older age groups. 
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Not more than four out of ten children at this level are testable 
for pitch discrimination either verbally or by interval singing. 


Siz-Year-Old. Children. (First Grade) 


All children at this level will sing readily, 62 per cent will sing 
directionally what they hear, and 14 per cent will sing accurately. 
Six-year-old children know the difference between “going пр — 
going down” in the visual field, but only two out of ten before 
extensive praetiee and five out of ten after extensive praetice ean 
use these terms successfully in the auditory field. | 

Slightly over half of the children at this age level are testable 
for pitch diserimination by means of the singing teehnique, with 
about one-fourth testable by means of the concept method. 


Seven-Year-Old Children (Second Grade) A 


Seven-year-old children will attempt to sing when asked to do 
so. Eight out of ten will sing directionally while four out of ten 
will sing accurately what they hear. 

Children at this age level understand the verbal сопсері of 
"going up — going down” in the visual field. Approximately five 
out of ten before extensive praetiee and six out of ten after ex- 
tensive practice can use the terms in the auditory field. There is a 
large increase in ability to use a verbal concept in the auditory 
field from the fifth to the seventh year. From the seventh year on 
the concept becomes more effeetive and singing less effective as 
methods for measuring piteh diserimination. 


FEight-Year-Old. Children (Third Grade) 


As at the previous levels, 


all children will sing when urged, but 
there are beginnings of resi 


Stance. The child begins to show some 
‚ Saying that he cannot sing very well or 
- Hight out of ten children sing direction- 
ally in the approved manner, and four out of ten sing accurately. 

All eight-year-old children understand a simple verbal concept of 
“going up — going down" in the visual field. Seven out of ten 
before extensive practice and nine out of ten after extensive ргас- 
Нее ean use such а concept in auditory discrimination. About Ше 
Same number of children are measurable by either the singing or 
verbal eoneept method; the latter is recommended. 


Group testing becomes really practical for the first time at this 
level, a majority now being testable by this means, 
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Nine-Year-Old Children (Fourth Grade) 

From this age on there is real resistance to singing in the ease 
of some boys. ‘‘I can’t sing"' accompanied by a negativistie atti- 
tude is not uneommon. The attitude does not represent the true 
picture, however, since eight out of ten children are able to sing 
direetionally what they hear, half of that number being able to 
sing accurately. 

Eight out of ten nine-year-old children before practice and nine 
out of ten after ргасіїсе are able to tell whether the sounds are 
going up or down with no embarrassment or resistance. The verbal 
concept method is especially recommended, therefore, for measur- 
ing purposes at this level, with the singing method as a supple- 
ment in individual cases. 


Ten-Year-Old. Children (Fifth Grade) 

Only one child out of ten is unable to sing directionally what he 
hears at ten years of age; six out of ten sing accurately what they 
hear. It is difficult, however, to secure codperation with the singing 
technique. 

Eight out of ten at this level understand the verbal concept in 
audition with preliminary instruetions; nine out of ten understand 
the сопсері after a minimal training period. Either method, the 
singing or the verbal еопсері method, is possible for individual 
measurement of ten-year-olds. Group measurement (using the соп- 
cept technique) is recommended іп practically all cases with indi- 
vidual tests for only those children failing the group test. 


IMPLICATIONS OF RESULTS 


As far back as the present investigator has been able to ‘‘push”’ 


techniques of differential measurement successfully, there is evi- 


dence in some cases of very fine pitch discriminative ability at the 


fifth year level. Additional evidence, however, indicates (so far as 
irrelevant factors could be objectively controlled) that there ap- 
pears to be some improvement in discriminative ability with age 


when large numbers of children were tested and average ability 


was determined. dk 
In dealing with children it is coneluded that the principles of 


psychophysical procedures ean be fairly closely adhered to, but that 
special precautions are necessary. By using fewer increments, by 
shortening the test, by introducing aids and objective checks on the 
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cognitive aspects of discrimination, and by introducing proper moti- 
vation, it was possible to present to children a series of paired tones 
differing by small amounts and to obtain results which сап be treated 
by aeceptable procedures. 

To the field of musieal development the contribution here has 
been limited since only one small section of a larger area has been 
investigated, that is, differential pitch sensitivity. A genetic ріс- 
ture of this one aspect of musical development, however, has been 
presented. It has been shown that measurement of pitch discrimi- 
nation is possible with some few children as early as the fourth 
year and that the majority of children are testable individually by 
the first year of school and in groups by the third school year. It is 
not so much the specific finding as the fact that practical meas- 
urement is possible at an early age which may be a contribution 
to the field of musical development. 

Finally, it appears that other tests of sensory capacities pat- 
terned in approach after the present one are now possible for chil- 
dren. It should be only a matter of time until such tests of musical 
capacity are available for those who desire them. 


ю 


. Раујоу, 


. Reymert, Martin 
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MANUAL OF INSTRUCTIONS AND IN TERPRE- 
TATIONS FOR A PITCH DISCRIMINATION 
TEST FOR YOUNG CHILDREN 


BY 


MELVIN S. HATTWICK 


MANUAL OF INSTRUCTIONS AND INTERPRE- 
TATIONS FOR A PITCH DISCRIMINATION 
TEST FOR YOUNG CHILDREN 


Part One of this monograph has presented an experimental 
evaluation of the methods applieable in testing piteh diserimination 
of young children aged four to ten years approximately. The 
present seetion presents а recommended proeedure for group and 
individual testing in the second, third, and fourth sehool grades. 
These are the levels at which the reliability of the test is known 


to be relatively high. 
MATERIALS NEEDED 


Ав constructed at present, the sound apparatus consists of three 
resonators, six metal bars (tuned to the following frequencies: 440, 
442, 444, 448, 457, and 470 d.v.), and a small rubber hammer or 
mallet. These are described in detail in Part One of this monograph, 
page 13. Of the intervals produced by this apparatus, 80, 17, 8, 4, 
and 2 d. v., some are easy enough for the poorest listener, yet others 
are difficult enough for all but the best listeners. 

For group testing Form С as deseribed on page 32 is the recom- 
mended record blank. This form is easily and economically mimeo- 


graphed. 
INSTRUCTIONS 


oughly familiar with the mechanics 
f practice in striking the bars are 
necessary before it is possible to produce two tones that do not 
differ appreciably in intensity. The bar should be struck so that a ` 
medium loud tone is heard. Each tone should last approximately 
one second, then should be damped by the finger. The second tone 
should be sounded as soon as the first is silent. 

The examiner should sit at the rear of the testing room when 
giving the test and should adapt the pace of the test to the slowest 
section of the group. After the interval has been presented, he 
should watch until all heads are raised slightly. It is well to have 


The examiner must be thor 
of the test. Several periods о 
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the regular teacher at the front of the room to wateh for copying, 
inattention, ete. 


The following specifie instructions should be followed as closely 
as possible: 


1. After the ehildren have filled in name and so forth on the blank, in- 
struct them as follows: **I want you to listen carefully. This is to be a hear- 
ing game. You will hear two sounds, one right after the other. Sometimes 
the last sound will be going up, and sometimes the last sound will be going 
down. Listen. ”? Demonstrate going up and going down. ‘‘Now if the last 
sound is going up you draw a line up the up road, like this.’? Demonstrate 
on Form C reproduced on the blackboard. “Ог if the last sound is going 
down, you are to draw a line down the down road, like this.’? Ilustrate. 

2. Then give preliminary practice on the largest interval, allowing the 
listeners to answer aloud as a group. Record correct responses on the form 
on the blackboard at the front of the room. Continue preliminary practice 
until all questions are answered, Repeat instructions once more just before 
the test is begun. That the demonstration and preliminary exercise must 
be given carefully and adequately cannot be emphasized too much. 

3. Pause at the end of each ten trial series. Have the children relax for 
а moment and commend or caution them depending upon their group at- 
tention. Remind them that ‘the next row, row number —, will be a little 
bit harder than the row you just finished. So sit up straight, listen very 
quietly, and see if you can get them all right. Remember, if the last sound 
is going up you go up the up road. 
go down the down road. Ready?” 


4. If the question is raised concerning the likeness of sounds, tell the 
children that there is always a difference. 


If the last sound is going down you 


COMPUTATION AND INTERPRETATION OF RESULTS 


The following key gives the standard order of stimuli for each 
trial; it also serves as a scoring key. The degree of difficulty of each 
interval in terms of d. v. and per cent of a whole tone is indicated 
at the left of each row. АП papers should be checked by this key, 


the score for each child being the total number of trials correct 
multiplied by two. 


Piteh Key 

Row Per Cent of Whole Tone Order of Stimuli* 

1 56 (30 d.v.) UUDUDDUUDD 
2 81 (17 d.v.) UDDDUUDDDU 
3 14 (8 dv.) DUDDDUUDDD 
4 7 (аа) UUDDUDDUDU 
5 4 (2 dv.) DUUDUUDUDU 
* U means up; D means down. 


AII test papers which (1) are ineom 


: plete, (2) have five or more 
consecutive wrong responses on any te 


n trial series, or (3) have less 
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than nine correct responses on the first ten trial series should be 
rejected. Such papers are not considered valid measures. (See Part 
One of this monograph, page 50.) Children who fail the group 
test may be tested individually later. 

All seores are interpreted in terms of percentile rank. Such a 
rank indieates the position whieh the child holds in a group on a 
scale of 100, 100 representing the highest rank, 0 the lowest, and 
50 the average. Since it is theoretically impossible for any individ- 
ual to have a percentile rank of 100 or 0, these are omitted. Be- 
cause of the small number of cases the norms of rank presented 
here are merely tentative. 

The following tabulation presents the norms. The first column 
gives percentile ranks for each fifth percentile. The second, third, 
and fourth columns represent the scores in per cent right for the 
several grades. For example, if a second grade child made a per cent 
score of 90 (score of 45 on test multiplied by 2), he would rank at the 
eightieth percentile. 

Tentative Norms for Pitch Scores 
Score, Per Cent Right 
Percentile Rank Grade 11 Grade ПІ Grade IV 


99 100 100 100 
95 96 97 98 
90 94 94 96 
85 92 92 94 
80 90 91 92 
75 88 90 91 
70 87 90 90 
65 85 89 89 
60 84 88 88 
55 83 86 87 
50 82 84 86 
45 81 83 85 
40 80 82 82 
35 79 80 81 
30 76 78 80 
25 74 76 78 
20 72 72 76 
15 70 71 75 
10 68 68 74 
5 64 64 68 
1 56 54 58 


INDIVIDUAL TESTS OF PITCH DISCRIMINATION FOR 
YOUNG CHILDREN 


f the children at the second grade level as 
d grade are not testable by the group 
e five or more con- 


Approximately half o 
well as many in the thir 
method. They do not complete their papers, hav 
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secutive wrong responses on any ten trial series, or have less than 
nine correct responses on the first ten trial series. 

After the group test papers are scored the examiner may test 
individually those who failed the group test. The same instructions 
are used in the individual test as in the group test. Now, however, 
the child is relieved of group distractions and the necessity of 
recording, since the examiner does the recording. The individual 
test should be given in a smaller room away from distractions and 
noise; the child should sit with his back to the examiner. The test 
may be informal, the child being told to ""ріау the game just as you 
did yesterday, except that this time you tell me whether the last 
sound is going up or down." Instructions are to be repeated until 
the child indieates he understands. 


DISADVANTAGE OF THE PITCH TEST 


Perhaps the greatest disadvantage of the piteh test for children 
described above is the method of sound produetion. Despite the ease 
of handling the equipment and presenting the series, it can hardly 
be expected that all examiners will strike the bells in the same 
manner or will exercise the same сате throughout the test. Since the 
produetion of sound intervals must be as earefully eontrolled as 
possible, a recorded test, especially with present excellent methods 
of sound recording, would undoubtedly be superior to the present 
test provided sueh recording takes into aecount the faetor of расе 
with young children. Recommendations for а recorded test for 
young children are presented in the Appendix. 
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А NOTE REGARDING THE PSYCHOPHYSICAL 
ANALYSIS OF PITCH DISCRIMINATION 
IN YOUNG CHILDREN 


А psychophysical analysis of the data on pitch discrimination 
collected by Hattwick (2) offers certain points of interest. The 
aspects of the problem to be considered in this note are (1) the 
determination of group thresholds by two methods, (2) the good- 
ness of fit of the empirical group curves to the normal probability 
integral, and (3) the distribution of individual thresholds. The 
determination of group thresholds yields information regarding 
normal pitch discrimination at the ages considered and evidence 
regarding growth with age. The estimation of the goodness of fit of 
the empirical curves offers evidence regarding the degree to which 
the mathematical assumptions of the constant method have been 
met in these data. The analysis of the distribution of individual 
thresholds may be used as a further check on the validity of the 
test as a measure of pitch discrimination. 


DETERMINATION OF GROUP THRESHOLDS 


By the Method of Constant Stimulus Differences 

The Hattwick (2) data follow the constant method in that a 
fixed series of repeated stimuli was used. They differ from the 
classical method in that the variable stimuli are all of the category 
‹‘отеабег,’” 80 that empirical values are given for only one-half of 
the eurve. This method necessitates scoring as ‘‘right’’ and 
‘wrong’ and sets the threshold at the 15 per сепі right point. 
ally called (4, p. 866) the *tmethod of constant 
U in which the standard is presented first or 
е data сап be treated mathematically by 
at least two procedures: (1) linear interpolation and (2) eurve 
fitting. A comparison between the results of these two methods with 
random numbers has been made by Linder (5) with results negligi- 


bly different. 
For the threshold anal 


This is now gener 
stimulus differences, 
last in random fashion. Th 


vsis only those cases meeting Hattwiek's 
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eriteria of comprehension were used, including those having 90 per 
cent right responses on the first stimulus and excluding those hav- 
ing five or more consecutive wrong responses. The percentages: on 
which these calculations were based were computed from the data 
of Table 15 of Hattwicks study (2, p. 54). 

The calculations for the method of linear interpolation follow 
the procedure given by Brown and Thomson (1, p. 60-61). The 


threshold values by grades for this method are given in the fol- 
lowing tabulation : 


Grade Group Threshold in 4. у. 
У 3.81 
ту 3.75 
ПІ 3.72 
ant 4.91 


By the Curve Fitting Method 


Since the method of linear interpolation uses only two empirical 
values and the Hattwick data give five points, the group thresholds 
were also calculated by a curve fitting method which makes it possi- 
ble to utilize all the data. For this purpose Kelley’s procedure (8, 
p. 326-330) seemed to be the most convenient. 

In this method the mean and standard deviation of the theoretical 
curves were determined, and the threshold (75 per cent correct 
judgments) located at a point one probable error above the mean. 
The tabulation below gives the statistical constants and the thres- 


hold values for the theoretical curves for the four grade groups 
considered : j 


Constants for Theoretical Curve Group 

Standard Probable Threshold 
Стаде Children Mean Deviation Error in d.v. 
№ 97 — 6.552 14.022 9.458 2.908 
IV 100 --7.188 16.354 11.031 3.298 
ПІ 102 --5.580 14.902 10.051 4.471 
11 48 --4.421 15.657 10.561 6.140 


The thresholds as determined by both methods give deereasing 
values as age increases. This corroborates Hattwiek's results on 
теап gross scores. The precautions taken to minimize the effeet of 
such factors as comprehension, attention, and interest in these 
samples of data have been discussed by Hattwick. To the degree 
that the test really measured pitch discrimination, it must be соп- 


cluded that there is a real but fairly small change with age in the 
function. 
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FIT OF GROUP DATA AND NORMAL PROBABILITY 
INTEGRAL 


Do the group data fit the theoretieal normal probability integral? 
Results on this point give a partial answer to the question of 
whether the assumptions underlying the constant method are sat- 
isfied by the data. The best answer to this problem would eome, 
of eourse, from the study of the data for individual eases, since 
group analysis involves the further assumption that the group re- 
sults are normally distributed. The data obtained on any indi- 
vidual were, however, far too meagre to justify the elaborate eurve 
fitting technique. Ап analysis of similar results for individuals by 
inspeetion and by the method of linear interpolation is given later 
in this article. 

Pearson's test of goodness of fit by the second method given in 
Brown and Thomson (1), when the successive percentages are rela- 
tively independent, was applied to the group data obtained by the 
curve fitting technique. The P-values, derived from Pearson’s 
tables (6), are given in the following tabulation, and the empirical 


and theoretical curves in Figure 1. 


Grade P-Value 
у; 148 
IV 328 
ПІ 252 
п .208 


: The fit, in most cases, 18 only fairly satisfaetory. Ав may be seen 
from Figure 1, however, the poorness of fit is not basically due to 
pronounced skew in the empirical data. Tendencies in the direc- 
tion of inversion seem to play a more prominent part. The goodness 
of fit does not vary systematically with age. This would suggest that 
any factors reducing the goodness of fit operate as effectively at 


one age as another. 


DISTRIBUTION OF INDIVIDUAL THRESHOLDS 
jvidual curves of percentages revealed 
three conditions in the data: (1) the conventionally adequate curve, 
(2) curves containing inversions, and (3) eurves indicating that 
the test was not sufficiently difficult, that is, curves which did not 
cross the 75 per cent line. Figure 2 gives a sample curve of each 
form from the data. The relative frequencies of the three types of 
curve by grades are given in the following tabulation : 


Ап examination of the ind 
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Conventional Type Inversions Too Easy 
Grade Children Number PerCent Number PerCent Number Per Cent 
v 97 61 63 12 12 24 25 
ту 100 22254 54 22 22 24 24 
TEL 102 73 72 14 14 15 14 
II 48 38 79 2 4 8 17 


The category ‘‘inversions’’ includes only cases where the inversion 
erossed the 75 per cent value. The inversions are пої particularly 
significant in such a small sample of trials per child. The number 
of children for whom the test seemed too easy is surprising and 
points to the need for a finer instrument for the measurement of 
some cases even at these ages. The mean and standard deviation of 
individual thresholds, determined by linear interpolation from the 
group of conventional curves, are given in the following tabulation: 


Grade Mean Threshold Standard Deviation 


Yy 5.00 3.32 
IV 6.13 5.63 
ПІ 6.18 5.48 

п 7.55 7.43 


Figure 3 shows the distributions. 


The appearanee of a substantial variation in individual thres- 
holds supports the view that the test was genuinely measuring 
piteh diserimination, since individual variation in piteh sensitivity 
within this selected sample of eases would be expected. The obvious 
Skew is partly a funetion of the failure of the test to distribute 
scores below 2 d. v. 


The applicability of psychophysical procedures to young children 
rests essentially on the basis) of their ability to compare repeatedly 
two things under relatively constant instruction. Simple choices 
are continually being made by children even at the preschool 
level. To cast these choices into patterns which will be adequate 
for psychophysical analysis is another matter, The procedure used 
by Hattwick seems to have been fairly successful at the ages given. 
While these data are suggestive, it is probable that the degree to 


which the necessary conditions are met will have to be determined 
for each new problem, 
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IMMEDIATE AND DELAYED MEMORY OF 
PRESCHOOL CHILDREN FOR PITOH 
IN TONAL SEQUENCES 


ву 


Hanorp М. WILLIAMS 


IMMEDIATE AND DELAYED MEMORY OF 
PRESCHOOL CHILDREN FOR PITCH 
IN TONAL SEQUENCES 


It has been the purpose of this study to estimate the degree of 
relationship existing at the preschool level between what may be 
called immediate and delayed recall of piteh sequences. А subsid- 
іагу purpose was to measure the effectiveness of a fairly intensive 
course of training in delayed recall. Vocal accuracy in immediate 
reproduetion of the sequences in the test for tonal reproduction 
previously reported by Williams (3, p. 70-79) served as the eri- 
terion for immediate recall. Aceuracy in the independent singing 
of comparable sections from learned nursery songs served as the 
criterion for delayed recall. While voco-motor control is necessarily 
а factor in the use of these criteria, it nevertheless сап be considered 
a constant in the comparisons made here. 


EXPERIMENTAL CONDITIONS 


For the immediate reproduction the voice was used as the stim- 
ulus. The child was instructed to sing the brief tonal sequences 
immediately after the experimenter. For the delayed reproduction 
a pitch pipe was used to give the keynote. The children were sim- 
ply instructed to sing the songs, starting at the given pitch. All 
stimuli and reproductions were recorded on the dictaphone. The 
records were later transcribed by ear to a special musical score. The 
recording was to the nearest half-tone.' 

The training period was informal but controlled with respect to 
the number of practices given the children. The group method of 
training was used, the teachers presenting the song voeally with 
the piano as accurately as possible and then inviting the children to 
join them in singing it. The efforts to obtain participation were 
quite generally successful for all members of the group. The songs 
were in every case unfamiliar to the children at the beginning of the 
training period. Two practices a day were given daily for each 

1 The validity of this method has already been discussed by Hattwick (1, р. 177-181). 


Тһе experimenter (Hattwick) had nlready shown an accuracy of vocal control and pitch 
discrimination approaching the accuracy of the machine itself. 
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song. The test songs were introduced, along with other songs, into 
а regular singing period. This plan was followed in order to avoid 
as far as possible the effects of boredom. The training period in- 
cluded a total of sixty practices, extending over thirty school days. 
This period of six weeks seemed to represent the maximum time 
which would allow a sufficient number ої praetiees while partially 
avoiding the effects of boredom and maturation. 

While three songs were introduced for the purposes.of the experi- 
ment, the data from only one were complete enough to warrant 
statistical study. This was ““Coek-a-Doodle-Doo,”’ the score for 
which is given in Figure 1. This song was taught to the five- and 
four-year-old groups of the Iowa Child Welfare Research Station 
and the University Elementary School. Due to absences, the num- 
ber of children available for the various computations varied. The 


mean age of the forty-one children in this study was 60.3, the range 
47 to 70 months. 


Figure 1, Musical Seore of Test Song in Delayed Recall Series 


Objective measurements on achievement (by the diet 
nique) were made before training and 
sixty practices. The children were tested individually in a small 
testing room. Since they had already been taught the words of the 
songs, the preliminary tests were conducted by h 
menter sing each song to the child in se 
brackets in Figure 1, and requiring the 
immediately after him. Т 
sentially a test of immedi 
approach, however, seem 
further means of measur 
recall. 


aphone tech- 
after ten, thirty-one, and 


сПопв as indicated by the 
child to sing each section 
hat this first presentation represented es- 
ate reproduetion is logieally obvious. This 
ed the most reasonable, since it offered a 
ing the reliability of the test of immediate 


METHOD OF SCORING 


Two types of seore are Possible from the records. In the first 


plaee, one сап simply record the number of correct responses, tone 


aving the experi- 
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by tone (correct responses). A second weighted score may be ob- 
tained by recording for eaeh tone the amount of error in half-tone 
units and assigning the cumulative total as a score (error score). 
According to the latter method of scoring, the near monotone, for 
example, will be more heavily penalized than the child who makes 
only small or oceasional errors. The correlation on the reproduction 
(Williams) series between the correct and the error score was 
—.791, probable error .040, for forty children on the first adminis- 
tration and —.974, probable error .007, for twenty-seven children 
on the second administration. This indicated a substantial similarity 
in result from both methods of scoring. In this study the weighted 
error scores were used throughout. 

The weighted error scores of the delayed memory series may be 
computed from the standard tone as a base or from the child’s own 
keynote. The scoring for the delayed recall series was computed 
from the notes indicated by the first and last braekets in Figure 1, 
omitting the middle group. 3 

LEARNING CURVES 


The group means and standard deviations for the error seores for 
the tests of delayed reeall are given in Table 1 for two groups, one 
of twenty-one children having complete records for the whole series 
and one of thirty-one ehildren with reeords on only the ten and 
thirty-one trial practice periods. The scores are tabulated from two 
bases, the stimulus and the ehild's own keynote. 

The general characteristics of the learning eurve represented 


TABLE 1 
Group Error Seores in Delayed Reeall for Children Completing the ‘Training 
Series 
| 21 Children* 31 Children** 
Test Standard Standard 

Mon Deviation Mean Deviation 

Standard Stimulus 
Preliminary 51.6 214 58.9 26.0 
After 10 practices 53.9 30.0 57.9 28.6 
After 31 practices 37.6 39.1 38.0 35.3 
After 60 practices 34.0 25.0 

Child's Own Keynote 
Preliminary 60.0 31.6 58.5 33.3 
After 10 practices 42.6 26.6 46.3 26.0 
After 31 practices 30.0 28.0 32.2 28.6 
After 60 practices 44.7 37.6 


* Mean Age, 61.8 Months; Age Range, 47-69 Months. 
** Mean Age, 61.9 Months; Age Range, 47-70 Months. 
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in the data of Table 1 may be seen from Figure 2. That the first 
presentation represented largely a test of immediate reproduetion 
has already been pointed out. It will be remembered that on all 
subsequent tests the child sang unaided except for the sounding of 
the keynote. Under these circumstances the lack of improvement, 
as shown in Figure 2, after ten trials is not surprising. The poorer 
section of the group (+ 1 standard deviation) aetually did less 
well at the tenth trial, although the better section (— 1 standard 
deviation) was more accurate at the tenth trial. This indicates the 
rapidity of the learning in the better group. The age means and 
ranges for the groups of thirty-one and twenty-one children each 
are practically identical, showing that the data from the two 
groups are quite comparable as far as selection in terms of age is 
concerned. The loss in numbers resulted entirely from the acciden- 
tal circumstance of absence on account of illness or other factors. 

Marked improvement in absolute error scores is shown from the 
tenth to the thirty-first trial, as may be seen from Figure 2. The 
mean improvement from the thirty-first to the sixtieth trial is much 
less. By the thirty-first trial the better section had learned the song 
to practical perfection and actually did worse at the sixtieth trial. 
This is interpreted as the result of boredom with the song. 

The curves caleulated from the standard and from the child's 
own keynote show one Systematie difference іп that from the be- 
ginning there is improvement in the latter. Seores obtained by eal- 


eulating from the child's own keynote reveal the aequisition of a 


lated from the standard, 
might still yield a large 
d that the curve for the 
in the aequisition of the 


children who were given both tests, 


Children First Test Final Test 
Mean Error Seore 31 107.4 113.4 
Standard Deviation 21 631 69.9 


The praetice in learning the melody resulted in no group gain on 
e test for reproduction. The slight loss may be attributed to bore- 
om and may parallel the loss in learning achievement from the 


th 
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thirty-first to the sixtieth trial. There is no evidence here of trans- 
fer from one ability to the other. These results are in agreement 
with the findings of Jersild and Bienstock (2). 


CORRELATIONS BETWEEN IMMEDIATE AND DELAYED 
RECALL 


The reliability for an immediate repetition of the Williams test 
for tonal reproduction has been reported (3, p. 78) as being of the 
order of .90 or better for groups comparable to those used in the 
present study. The correlation for the ""еггог зеоге'' between the 
initial and final applieation of the test after the training period 
reported here was -698, probable error .066, for twenty-seven chil- 
dren. The eorrelation for the ""соггесі response” score was 774, 
probable error .052, for the same children. 

There seems to be, therefore, a decrement in reliability or pre- 
dictive power of the test for immediate reproduction when this is 
measured over a six week period such as that covered in the present 
Study. Ап inspeetion of individual scores showed that gains, losses, 
and no change in зеоге оп the test were about equally distributed. 
T'he split-half reliability of the test of reproduetion in the delayed 
recall series (first applieation of the test) was .869, probable error 
.029, for thirty-three eases. By the Spearman-Brown formula this 
reliability became .929. 

The correlations between the error scores for the Williams tests 
of immediate геса and the tests of delayed recall 
Table 2. 


On the basis of these results, it may be said that the 


are given in 


reliability 


TABLE 2 
Correlations Between Tests of Immediate and Delayed Reeall for Thirty-One 
Children 
First Second 
Test 
т Р.Е. т Р.Е. 
Standard Stimulus 
Preliminary .536 .086 .595 .078 
After 10 trials .582 .080 566 082 
After 31 trials .630 .073 .806 .048 
Child's Own Keynote 
Preliminary 187 117 301 110 
After 10 trials .274 112 .388 202 
After 31 trials .281 4111 -560 .083 
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of the test of immediate reproduetion of piteh is only slightly 
greater, after a six week interval, than the correlation of this test 
with scores on delayed recall after practice, when both are measured 
from a given standard tone. These correlations drop materially 
when the scores after practice are measured from the child’s own 
keynote. This is interpreted as evidence of a considerable degree 
of independence between two abilities, one of which may be de- 
seribed as the immediate reproduction of pitch sequences and the 
other as the ability to carry in memory а tonal pattern. The differ- 
ences already shown between the learning curves calculated from 
the standard stimulus and from the child’s own keynote are con- 
sistent with this finding. 

The correlations between scores on the Williams tests of immedi- 
ate reproduction and successive gains in the learning scores were 
also calculated for the thirty-one children who completed thirty- 
one practice trials. The results are given in the following tabula- 


tion: 
Gain at Gain of Thirty-First 
Williams Test Tenth Trial Over Tenth Trial 
г Р.Е. г РЕ. 
Standard Stimulus 
I .097 .120 131 119 
п .294 111 .385 110 
Child's Own Keynote 
I —.051 4191 055 121 
II --.164 118 .223 115 


These аге, in a sense, the crucial eorrelations and add further 
evidenee of the relative independence of the two abilities. 


CONCLUSIONS 


The following conclusions are drawn from the data presented. 

1. Under eontrolled eonditions, great individual differenees ex- 
isted in the ability of this group of young children in immediate 
reproduetion of tonal sequences and in ability to learn to repro- 
duee a simple melody in delayed recall. Both abilities ranged from 
an achievement which was little more than what might be achieved 
by random effort to a practically perfect rendition. 

2. Group improvement in delayed recall was apparent over the 
4. There was some indication of more rapid early im- 
arning of pattern than in the absolute reproduc- 
a slight loss as the result of prolonged prac- 


practice perio 
provement in the le 
tion of pitch. There was 
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tice. There was no evidence in the group results of transfer to imme- 
diate reproduction of pitch. 


3. A test of immediate recall prediets the relative position of 
children after varying degrees of training with correlations of the 
order of .60 when these results are calculated from the given 
standard tone as a base. These correlations drop to the order of .20 
when the child’s own keynote is used as a base. The correlations 


between ability in immediate reproduction and gain in delayed 
recall were negligible. 


4. These results suggest that there is a considerable degree of in- 
dependence between ability in immediate and ability in delayed 
reproduction of pitch sequences at the preschool level. 
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APPENDIX 


APPENDIX 
EXAGGERATION OF INTERVALS 


Twenty-two preschool children, ages four years, two months to 
five years, eleven months, who had scored at least twenty-three out 
of twenty-five correct responses оп the intervals of the first main 
interval singing test and who could sing directionally what they 
heard in the Ediphone test were later given individual tests of pitch 
discrimination. The twenty-five trials used bells as stimuli; the 


following intervals were used: 21, 18, 14, 10, 6, 4, 3 d. v. 
Only nine of the twenty-two ehildren exaggerated either the 
to a perceptible extent. As far as the 


larger or smaller intervals 
experimenter eould tell, thirteen sang accurately even on the smaller 
intervals. The examiner could not determine reliably whether the 


intervals sung below 3 d.v. were sung on piteh or not since the 


exaggerated differenees were 50 small. 


AND INSTRUCTIONS FOR THE SECOND 
INTERVAL SINGING TEST 


Each child was brought into the room and seated at a table on 
the opposite side from where the experimenter sat. An informal 
period of a few minutes was devoted to establishing rapport. 

After eight trials of illustrated instruetions the child was given 
a ten trial test. If he responded correctly on nine of the ten trials, 
he was considered testable for pitch discrimination. If he scored 
Jess than nine successes on ten trials, the experimenter gave him 
ten practice trials on the 30 d.v. interval, recording the verbal 
responses. If the child made a mistake in singing, the experimenter 
said, “Хо, the bells didn’t ring ‘ding-ding.’ Listen." The experi- 
menter rang the bells; then sang like the bells. ‘‘That’s the way 
the bells rang. Now we'll try some more, and be sure to sing just 
like the bells." At the end of the ten trial practice period a test 
series of ten trials was given. If the child still scored less than nine 
out of ten successes, he was given another practice series of ten 


trials and a final series of ten test trials. 


PROCEDURE 
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TYPICAL PROCEDURE AND INSTRUCTIONS USED IN 
GROUP TESTING 


The children were seated alternately and provided with pencils 
and appropriate blanks. Instructions were as follows: 


“бота Up — Going Down? Terminology 


"I want you to listen very carefully beeause this is going to be a 
hearing game.’’ The word test was used in plaee of game for the 
fifth grade. ‘‘You will hear two sounds, one right after the other, 
coming from the Victrola. Sometimes the last sound will be going 
up, sometimes it will be going down. Listen.” The examiner played 
four intervals, stating the correct response after each. Now be- 
fore we start the real game we'll have some practice trials to show 
you how it goes. I'll play some sounds and you will tell me which 
way they are going, up or down, and I'll write the answers on the 
large form here on the board. Now listen carefully.” 

The examiner presented at least five intervals saying after the 
group response, ““ Үез, going up (or down)” and writing “U” or 
“D” in the correct square of the reproduced form on the black- 
board. At the end of the practice period he said to the children, 
“Поев every one understand how to play the game? Don't be 
afraid to raise your hand if you have questions." (Pause) Usually 
there were several who had questions, and these were answered 
before proceeding with the actual test. The questions were recorded. 

The examiner then said, “Мол take your peneils and get ready 
to start on row number 1. Go right aeross the page just as you do 
in reading. If the sounds are going up, write a U (illustrating) in 
the square; if the sounds are going down, write a D (illustrating) 
in the square. Remember the rest of your grade was just in here 
(or will be in here next) and I want to see if you ean do as well 
as they did (or will do). Ready?” 

After each ten trials the test was stopped, and the examiner 
said, Remember we have to be very quiet so everyone ean hear 
well. Now start on row number 2. If the sounds are going up, mark 
U in the square, if they are going down, mark D. Ready? Very 
quiet now.” 

A description of Forms A, B, and C are given on p 
The procedure for recording Forms B and C w 
above exeept for the necessary ehanges in verbal i 
the method of recording. 


age 32. 
as similar to the 
nstruetions due to 
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«‹ Higher-Lower"! Terminology 

Instruetions paralleled those for “going up — going dom? ex- 
cept for the substitution of ‘‘higher-lower’’ wherever ‘‘going up — 
going down" occurred. 


A RECORDED TEST OF PITCH DISCRIMINATION FOR 
YOUNG CHILDREN 


of pitch discrimination for 


The advantages of a recorded test 
of test administration, (2) 


young children are (1) simplicity 
AN Gan of sound production, and (3) eventual monetary 
vanced n Hs disadvantages are (1) introduetion of extraneous 
еа 554 (іп comparison to the sound produced by а tuning 
" ar) and (2) difficulties in varying the pace of the test 
o meet the demands of children's groups. 

Present methods of recording reduce the first disadvantage to а 
minimum, but the last is more diffieult to overcome. ТЕ is possible, 
however, to make a reeorded test whieh will meet the requirements 
of the present bell test to à large degree. In addition, if proper 
arrangements were made, the children’s test could be recorded on 
the same dise as that for adults, as suggested in the following para- 


graphs, 

The present Seashore te 
tervals in too rapid succession for С 
fourth, third, and second grade children. 
average time taken by the groups when set 
the test with that of the present recorded test i 
additional three or four seconds between each test 
the test so that every child would have time to T 
individual trial. Such an extension between trials shou 
іп й less reliable test for adults. 
Pardo the Seashore pitch test run 

g from one side of the record to t 


Ди to give selected series from the 
2, 8, and 2 d. v. ten trial series without the other series). Ша 


narrow nonreeorded section were left between each ten trial series, 
the examiner could then choose the intervals for the children’s test 
pes ten-trial series) from the adult test (еп ten-trial series). 
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made during this intermission and instructions repeated before 

` going to the following series. The value of breaking up the fifty 
trial test into units of ten trials eaeh has been diseussed and recom- 
mended in the first study in this monograph. 

The revised test of piteh diserimination for adults and older 
children to be recorded soon does not inelude the 30 or 17 d.v. 
intervals of the test for ehildren suggested here. A 15 d. v. interval 
might be substituted for the 17 d. v. interval used in the present 
test, but it is strongly recommended that the large 30 d. v. interval 
be ineluded in the test. This interval has little diseriminative value, 
but for demonstrative and instruetional purposes with children a 
large easily heard interval is strongly recommended. 

By allowing a few seconds more time between individual trials 
and by placing a narrow nonrecorded section between each ten 
trial series on the record, the forthcoming revision for adults and 
older children may well be used as a test for younger children. In 
addition to permitting a short group test for children (fifty trials, 
using the 30, 15, 8, 4, and 2 d. v.), such a recording would permit 
the further advantage of concentrating on the important intervals 
in individual tests to the exclusion of other intervals. 


